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PREFACE 


Volume 17 of the Annual Review of Sociology continues and extends several 
trends that the Editors and Editorial Committee members have attemped to 
foster in recent years. One set of chapters provides comprehensive surveys of 
recent research in well-established areas of sociology. For example, Kasarda 
& Crenshaw review recent studies of urbanization in the third world; Maynard 
provides an overview of the variety of work being conducted within 
ethnomethodology; Scott reviews recent comparative work on corporate pow- 
er networks; Frey & Eitzen bring us up to date on developments within the 
sociology of sports;-and Treiman, Ganzeboom & Ultee provide a com- 
prehensive survey of comparative research on intergenerational mobility. 

Taking a different tack, other chapters explore the complex in- 
terconnections between two or more areas of research. Examples of such 
bridging or linking chapters include Menaghan's chapter on work and family 
interaction; Padraza's review of research on women and migration; and Vega 
& Rumbaut's discussion of work on mental health and ethnic minorities. 

Another cluster of chapters is designed to bring general readers up to date 
on recent developments in specific methodologies—e.g. Dillman's review of 
mail survey methods—or particular theories—e.g. Agger's review of post- 
structuralism, postmodernism and critical theory. 

Still other chapters are designed to call attention to topics that are less well 
known or less highly institutionalized. Such topics are often in early stages of 
development or are of obvious importance but lack clearly defined bound- 
aries; they may cross-cut a number of research areas. Instances of such topics 
here include Burstein's chapter on policy domains, Chase's review of vacancy 
chain processes in various types of social structures, Kerzer's chapter on 
family inheritance patterns, and Modell & Haggerty's review of the social 
effects of war. 

We happily continue our tradition, instituted in 1986, of the prefatory essay 
authored by a distinguished sociologist. In this volume, Mirra Komarovsky 
reflects on the emergence and development of feminist scholarship within 
sociology. 

Where do these and the other topics come from and how are their authors 
selected? Currently there exist five sources of ideas for prospective ARS 
chapters. First, and most important in terms of the number of chapters 
generated, the Editors and the members of the Editorial Committee propose 


(continued) v 


vi PREFACE (continued) 


topics for review along with names of possible authors. Second, two "guest" 
committee members are invited to attend the annual Editorial Committee 
meeting and to suggest topics and authors. Third, a set of "corresponding 
members," sociologists residing in countries other than the United States or 
Canada, are asked to propose topics and authors. Fourth, previous authors of 
ARS chapters are asked if they have suggestions of other areas ripe for review, 
or, the fifth year after their chapter was published, if they believe that an 
updating of their topic is warranted. Finally we receive an increasing number 
of unsolicited suggestions for topics—individual social scientists proposing 
topics they would like to review. In the case of such "volunteered" sugges- 
tions, we request that the would-be author prepare a brief outline of the 
proposed chapter together with a brief bibliography of works proposed for 
review. 

These many sources currently produce well over one hundred proposals for 
topics each year. These proposals are reviewed and evaluated in a fascinating, 
frustrating, marathon meeting of the Editors, Editorial Committee, and Guest 
Members, usually held in February. (For a listing of those individuals 
responsible for the contents of volume seventeen, see p. ii.) Given the extent 
and diversity of input, and the need to be attentive to balance of coverage, no 
person has more than one or two of their own proposals approved. Typically 
only two or three of the volunteered topics are selected. 

Prospective authors are invited in March or April of each year to undertake 
reviews of the selected topics; their manuscripts are due September 1 of the 
following year. A very high proportion of those invited accept the invitation 
to prepare a chapter. All chapter drafts are subject to editorial review, often 
with the aid of external reviewers, and are subject to rejection. Many chapters 
undergo substantial revision following review. 

The selection and review process has undergone considerable change dur- 
ing our nearly two decades of operation. We believe that the current process 
works well, as demonstrated by the quality of the published reviews, but we 
also recognize that there is always room for improvement. We welcome your 
reactions, comments, and suggestions. 


The Editors and Editorial Committee 
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Abstract 


The emergence of the women's movement in the 1960s presents two chal- 
lenges to sociology. The more obvious task is the analysis of a social ` 
movement: How did private problems get transformed into a collective protest 
at that particular historical moment? This article, however, addresses itself to 
the contribution of a social movement to the sociology of sociology. Feminist 
sociologists, in representing a disadvantaged group, claim to look at society 
from a new angle of vision. What was the impact upon the discipline of 
sociology of this new perspective? Feminist criticism of mainstream sociolo- 
gy revealed not only vast lacunae in our knowledge but flawed interpretations 
of social phenomena. Feminist theoreticians have extended their criticism to 
some epistomelogical positions of contemporary American sociology. 

The purpose of this article is two-fold. It illustrates some contributions to 
sociology by feminist scholars (including a few precursors). Secondly, some 
changing trends in feminist orientations are discerned and analyzed from the 
perspective of the sociology of knowledge. 


INTRODUCTION 


The resurgence of the women's movement since the 1960s has been accom- 
panied by a considerable flow of publications by feminist scholars in the 
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social sciences, history, and the humanities. In addition to these disciplinary 
contributions, and associated with the proliferation of “Women’s Studies" in 
colleges and universities, this period witnessed an increase in interdisciplinary 
publications on gender. This literature is generally described by sympathetic 
writers as the "new feminist scholarship" or “the new feminist theory" to 
distinguish it from earlier feminist writings. 

The purpose of this essay will be clarified by a statement of its boundaries. 
Its theoretical thrust is not an analysis of the women's movement but a 
description of contributions to sociology that have emerged from feminist 
concerns. This focus rules out the interdisciplinary literature with its hope of 
eventually transcending individual disciplines within some single integrative 
paradigm. 

Even restricted to sociology, however, no single article can do justice to the 
vast new literature. Writings on feminist epistomology and its implications for 
methods of research fall outside the scope of this chapter. Likewise, the 
diversity of political orientations within the feminist movement—liberal, 
radical, neo-marxist, and others—surely would require a chapter of its own. 
My focus is on a few feminist contributions that refute an accepted sociologi- 
cal generalization, fill a lacuna in our discipline, or chart an original theoretic- 
al direction. 

What is a "feminist" sociological contribution? How does one distinguish it 
from other exogenous and endogenous changes in a discipline? After all, the 
resurgence of the women's movement since the 1960s occurred in response to 
various economic, social, and cultural changes in American society. These 
changes have had a direct influence upon the discipline of sociology, un- 
mediated by the women's movement. The engine of change may be even 
more complex. Once aroused, the women's movement may have triggered 
societal changes which in turn affected the field of sociology. To illustrate this 
complex nexus of changes, let us assume that Blau & Duncan were to publish 
a new edition of their 1967 American Occupational Structure. It is likely that 
the new edition would contain, unlike the original, some reference to women 
in the labor force. If so, would this revision be a response to feminist articles 
on intellectual sexism in current approaches to social stratification (Acker 
1973) or to the fact that for the first time women hold the majority of 
professional jobs in the country (an increase that in some degree may itself be 
attributable to the women's movement)—or to all these influences combined? 

Ihave followed a pragmatic strategy that comes close, I believe, to solving 
this problem. I have limited myself to a content analysis of contributions by 
self-styled feminists or sociologists who avow a general support for the 
women's movement. 

Finally, some fluctuating perspectives of feminist theory are not exempt 
here from a scrutiny of their possible existential roots and, as an independent 
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issue, their validity. The latter question inevitably confronts current polemics 
about models of verification. 


THE MISSING PROBLEM 


Put in the most general terms, the feminists made manifest a social problem 
that was invisible in mainstream sociology prior to the 1960s. This is not to 
disregard some historical precursors. The few scattered forerunners, writing 
between the 1930s and 1960s, are presently to be acknowledged here. But the 
overriding fact remains that neither the general sociology textbooks nor books 
on social problems or the family registered any concern with the “women’s 
problem" before the rise of new feminism in the 1960s. 

Merton reminds us that “popular perceptions are no safe guide to the actual 
magnitude of a social problem. Ill-understood but partly known processes of 
social perception involve patterned omitting, supplementing, and organizing 
of what is selectively perceived as the social reality" (Merton & Nisbet 
1971:811). It is now apparent that sociologists, whose function as scientists is 
to discern what Merton termed "latent" social problems, failed in this case to 
transcend popular perceptions. Apparently scientific objectivity with its 
“value free” ethic cannot always be depended upon to portray social reality 
accurately. 

The latent social problem here is one of social disorganization—the dys- 
functions of current social arrangements and accepted beliefs for other, still 
more central, values of our society. What is at issue is no longer some isolated 
legal or other inequity suffered by women, but the total system of gender 
stratification. The persistence of traditional differentiation of the social roles 
of men and women gives men more benefits, authority, power, and prestige. 
Leaving aside our historical past, such gender stratification which violates 
democratic ideals is no longer defensible or functional for contemporary 
society. As Merton observed: "Those occupying strategic positions of author- 
ity and power . . . carry more weight than others" as judges of what 
constitutes a social problem (Merton & Nisbet 1971:803). Male sociologists 
dominate the discipline, and they did not escape the distortions inherent in 
their superior status in the profession and in the society at large. Their 
indifference need not, at least at the beginning of the women's movement, be 
explained merely by vested interests. As W. J. Goode observed (1982:137), 
men, as other dominant groups, tend to assume that their superior accomplish- 
ments are not the result of social advantages but of inborn superiority. 
Moreover, as is generally the case, human beings take for granted the benefits 
derived from some social arrangement but are aware of its costs for them. It is 
likely that men viewed women as the more sheltered sex, free from male 
pressure to provide and to achieve. 
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If sociologists did not regard gender stratification as a social problem, some 
disadvantages women suffered were recognized. But mere recognition does 
not suffice for remedy if the disadvantages are relegated to the private domain 
or considered to be inherent in the human condition. 

Violence against women, in and outside the family, is one of the more 
dramatic and familiar cases in point. That physical abuse of women existed 
was generally acknowledged, but even the victims accepted their fate as an 
individual problem, to be handled without invoking social sanctions. As to 
family sociology, O'Brien (1971) noted that in the index for all editions of the 
Journal of Marriage and the Family from its inception in 1939 through 1969, 
not a single article can be found that contained the word “violence” in the 
title. The imperative to protect family privacy could not account for this 
omission since sexual and other sensitive aspects of family relations were 
widely studied. 

The present concern with rape and other forms of physical abuse of women 
extends far beyond academic publications to national media and has led to 
some legal reforms. Absent the feminist movement, such developments 
would hardly have taken place. “In less than ten years, wife beating has been 
transformed from a subject of private misery and shame to an object of public 
concern" (Tierney 1982:210). 

Feminist criticism does not stop with identifying the failure of mainstream 
sociology to perceive the status of women as a social problem. Sociologists 
were equally blind to gender as a sociological problem. In a society in which 
men dominate the major institutions and make the critical decisions, it is all 
too easy to assume that the behavior of men, and not of women, is the more 
significant object of study. Hence, concerns of women tended to be either 
invisible to male sociologists or to appear trivial. Insofar as this criticism is 
valid, such a one-sided perspective was bound not only to leave lacunae but to 
result in actual distortions in the sociological analysis of society. 

Sociology textbooks on social problems and general sociology or on the 
family were slow to include gender issues even after the few precursors might 
have alerted their authors to these. The lag between the precursors and the 
standard textbooks can be inferred from the dates of some illustrative publica- 
tions. I shall first illustrate the invisible problem. No reference to women's 
problems can be found in the following books: Merton, Broom, & Cottrell, 
eds., Sociology Today (1959); Lipset & Smelser, eds., Sociology: The Pro- 
gress of a Decade (1961); Merton & Nisbet, eds., Contemporary Social 
Problems (1961, 2nd ed 1966); Gouldner & Gouldner, Modern Sociology 
(1963); Faris, ed., Handbook of Modern Sociology (1967); and Horton & 
Hunt, Sociology (1968). 

Equally silent on such issues were the following family texts: Bernard, 
American Family Behavior (1942); Nimkoff, Marriage and the Family 
(1947); Winch, The Modern Family (1953); Kephart, The Family, Society, 
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and the Individual (1961); Christenson, Handbook of Marriage and the 
Family (1964); and Kenkel, The Family in Perspective (1966). Burgess, 
Locke, & Thomes, The Family From Institution to Companionship (1963), 
continued to celebrate the allegedly egalitarian type of American marriage. 
But the text did not question whether the traditional division of labor was 
compatible with equality between the sexes. As we shall see in the following 
pages, some family sociologists not only failed to initiate any concern with 
the women's problems, they criticized the few feminist precursors. 


THE PRECURSORS: 1930-1960 


Feminist literature has a long history, but for our purposes the immediate 
precursors of the "new" feminist scholarship are the writers between 1930 and 
1960. 


Margaret Mead, Sex and Temperament in Three Primitive 
Societies (1935) 


Franz Boas in anthropology and William F. Ogburn in sociology battled 
against the prevailing biological determinism. In accounting for racial and 
class differences they were on the side of culture. But it was the student of 
both, Margaret Mead, who dramatized the thesis of cultural explanation of 
psychological sex differences in her description of three societies. In the 
preface to the 1950 edition of Sex and Temperament in Three Primitive 
Societies Mead states: 


I found in one [society], both men and women act as we expect women to act—in a mild, 
parental, responsive way; in the second, both act as we expect men to act—in a fierce, 
initiating fashion; and in the third, men act according to our stereotype for women—are 
‘catty’, wear curls and go shopping; while women are energetic, managerial, unadorned 
partners. (1950:1) 


I shall leave open the question of the reliability of the evidence. That all 
three logical options of psychological sex-typing should be discovered on a 
single field trip appears incredibly lucky. All the same, unfreezing biological 
determinism had profound theoretical implications. It opened up for social 
and cultural analysis a new problematic. How to account for the differences in 
definitions of femininity and masculinity and in sex roles across different 
societies and historical periods? How are men and women socialized to accept 
the prescribed patterns? What, if any, are the social provisions for the 
deviants? More broadly, what are the costs and benefits of these various 
arrangements for the society and the individuals? 

These issues have challenged and will continue to challenge sociologists 
whatever the fluctuating popularities of the nature vs nurture debates. 

In her 1949 book Male and Female, Mead moved away from the earlier 
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position of cultural determinism toward a modified view suggested by the 
very title of the concluding chapter: “To Both Their Own." This book was 
written at the height of what I termed the post-World War II "new- 
antifeminism" (Komarovsky 1953). Male and Female certainly reflects the 
influence of psychoanalytic theories. Its author seems to be torn by ambiva- 
lences. On the one hand, she recognizes that insofar as women are barred 
from “the great structures of law and government, religion, art, and science. . 
. Women become less human" (1949:381). She acknowledges the existence of 
overlap in psychological abilities of men and women. And yet, she writes of 
the risks of "bringing women into fields that have been defined as male . . . 
(because) this frightens the men, unsexes the women" (1949:379). This 
warning confuses short-range with long-range effects of changes in women's 
status. Ultimately, Mead's emphasis in this book falls upon the need to 
acknowledge and cultivate the unique talents of each sex. The evidence for 
the existence of these unique gifts is presented excursively, drawn in a general 
way from psychology, psychoanalysis, and conventional wisdom. 


Viola Klein, The Feminine Character: History of an Ideology 
(1946) 


This book is less widely known than Mead's contributions and requires a 
somewhat fuller treatment. With an introduction by Karl Mannheim, the book 
is undertaken as a study in sociology of knowledge. It examines the views of 
the feminine personality that were developed by eight authorities representing 
various disciplines, beginning with Havelock Ellis and Otto Weininger and 
ending with Margaret Mead and W. I. Thomas. 

All but the most recent writers conceived of human personality in bipolar 
terms; hence, they convey their views of both the female and the contrasting 
male personality. It is a salutary reminder for contemporary readers to review 
the long periods when women were thought to lack the essential qualities of 
humanity. 

The author concludes that as to the typical feminine personality "there are 
almost as many opinions as there are minds." This diversity of opinions is 
grist for the mill of any sociologist of knowledge. Klein makes a valiant effort 
to ground an author's view of femininity in the social context of the author's 
time and life. The introductory chapter on the historical background 
notwithstanding, Klein is more successful in establishing cultural than in- 
stitutional roots of the eight conceptions of femininity. By cultural I mean the 
philosophical and scientific currents accessible to a given author. Similarly, 
this cultural climate is, admittedly, explored more fully than the idiosyncratic 
personality of the writer. 

In addition to describing the nature of femininity as expounded by eight 
authorities, Klein raises more general epistomological issues. 
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The first concerns the question of objectivity in the social sciences. The 
book demonstrates, in the tradition of sociology of knowledge, that there can 
be no study free of a conscious or unconscious partial perspective on the part 
of the knower. The very choice of the subject, the formulation of the problem, 
the concepts in terms of which experience is perceived—all are affected by 
the scholar’s position in a specified society, as well as by the scholar's more 
personal life experience. s 

This being the case, can we ever arrive at the "truth"? Total relativism, 
which might appear as the inevitable end result of this analysis, is rejected by 
Klein. We come closest to "objective" truth when we lay bare the social and 
psychological roots of diverse perspectives and seek an ever richer integration 
of their partialities (Klein 1946:3). 

Another general conclusion is Klein's recognition of a shift away from the 
dominant bipolarity of sex stereotypes toward diversity. The industrial revolu- 
tion shifted hitherto familial economic functions to outside agencies and 
women entered the labor force. The general differentiation of society led to 
the recognition of diversity of influences shaping personality, apart from sex: 
*Social class, religious background, age, race, vocation, family relationships, 
early training, opportunities for development, social tradition, individual, 
physical and psychological dispositions, and others." The logical inference 

. is "the view that individual differences prevail over differences between 
whole groups" (Klein 1946:170). 

But what of the organic, biologically determined sex differences? Klein 
does not deny their reality. She proposes a sociological strategy. Let sociolo- 
gy continue to concentrate on sociopsychological sex differences. Biological 
determinants will fall into place as residual differences, those sociology 
cannot account for. Such a direction still strikes me as the primary sociologi- 
cal agenda. Nevertheless, biological sciences today provide more reliable 
methods of studying hormonal and other organic differences and similarities 
between the sexes. As reliable evidence accumulates, its relevance to socio- 
logical issues must be examined. 


Talcott Parsons, “Age and Sex in the Social Structure” 
(1942) 


Parsons in his famous article, *Age and Sex in the Social Structure" (1942), 
can hardly be called a precursor of feminist scholarship. He is included here 
as the prime target of feminist criticism in the 1960s and the 1970S.! 


!Not all challenges to Parsons' theory came from the feminist camp. Among others, Slater in 
1961 questioned the universality of role differentiation in the nuclear family along the in- 
strumental-expressive axis. Moreover, he argued that parental role differentiation, far from 
facilitating the child's identification with same-sex parent, tends to be dysfunctional for child 
development. 
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Contrary to some critics, this article did refer to women's role strains. But 
there is no denying that its central thesis was the congruence of sex role 
differentiation with the functional prerequisites of our social system. Parsons 
makes no reference to possible alternative social institutions. The very de- 
scription of women's role strains confirms this judgement. He states, "In a 
society where such strong emphasis is placed on individual achievement, it is 
not surprising that there should be a certain romantic nostalgia for the time 
(that is prior to her marriage) when the women's options were still open" (p. 
610). Doesn't the expression "romantic nostalgia" suggest a view of this 
problem as one of the inevitable pains of the human condition? 

To the best of my knowledge, however, the earliest radical departure from 
Parson's reference to women's "romantic nostalgia" was published in my 
article, “Cultural Contradictions and Sex Roles" (1946:184—89). The theme 
of that article had germinated in my mind long before I ever read Parsons and 
was not originally conceived as a critique of him. I am not sure that I 
generally endorse Dorothy Smith's (1979:163) notion of a distinctively femi- 
nist epistomology, but this article did have its origin in my personal experi- 
ence of conflicts. So grounded, my minihypotheses were subsequently tested 
in interviews with women undergraduates. 

That college women “played dumb" on dates, generally taken to be the 
import of that article, is not an adequate summary. I called attention to a 
distinction between two sets of values to which young women were exposed 
and by which they had to judge themselves. In an academically demanding 
women's college, students were rewarded for intelligence, initiative, in- 
dependence, articulateness, self-confidence, creativity, persistence—indeed, 
many of the valued human attributes expected also of men. However, in 
interaction with men women had to display these qualities cautiously and only 
so far as "the traffic will bear." Their male partners were spared the strain on 
dates of suppressing their proudest achievements. 

This 1946 article also identified the discontinuities in socialization of girls 
within the family and some cross-pressures from agents of socialization. 
Returning to Parsons—my final position was a sharp repudiation of his. I held 
that the dominant definition of sex roles, far from being functional, presented 
a case of social disorganization produced by the lead-lag pattern of social 
change. I quote my conclusion: "The problems set forth in this article will 
persist until the adult sex roles are redefined in greater harmony with socio- 
economic and ideological character of our society" (Komarovsky 1946:189). 


Mirra Komarovsky, Women in the Modern World: Their 
Education and Dilemmas (1953) 


This book was written at the height of what I termed the antifeminist wave 
that swept the country after World War II. It was not the women's movement 
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that motivated its writing but the conservative backlash. In 1945 Helene 
Deutsch developed, in a two-volume work, the Freudian thesis that "anatomy 
is destiny." In 1947, the psychiatrist Marynia Farnham described the plight of 
women as one of “overwhelming emotional catastrophy" and held the femin- 
ist movement responsible for the tragedy. In 1950, Lynn White, president of 
Mills College, in Educating our Daughters, accused women's colleges of 
educating women as if they were “men in disguise.” My book was written as a 
counterattack. Its core, however, was a portrayal in depth of a variety of role 
strains and of the serious discontent of college-educated, full-time homemak- 
ers, as well as the costs in our society of combining family life and a career. 
These social problems, I maintained, were not inescapable dilemmas of life, 
but problems potentially amenable to social control. The theoretical un- 
derpinnings of the book combined Ogburn's theory of social lag and Merton's 
variety-of-role theory. 

Reviewing Women in the Modern World in The Journal of Marriage and 
Family Living (February 1955:75) Kephart writes, "The women that Mirra 
Komarovsky has written about . . . seem to have little in common with the 
often-taunted, often-endeared, often-devoted women who comprise our 
wives, mothers, and daughters . . . (But) the clarity and vigor of presentation 
give the reader the overall impression that a fair-sized Social Problem is 
involved." In his own 1961 textbook on the family, Kephart gives short shrift 
to this particular social problem. If it exists at all, it is caused, Kephart claims, 
by the unwarranted low status accorded the housewife: "even though it may 
take a high level of skill to run a house properly" (Kephart 1961:238). 

Another family sociologist, writing as late as 1966, also failed to perceive 
any problematic features in full-time homemaking. William Kenkel asserts in 
his textbook that "from the standpoint of equipment and the time available for 
caring for children and a home, the American woman should be the envy of 
her sisters around the world" (Kenkel 1966:446). 

To the best of my knowledge only three exceptions to the treatment of this 
issue appeared in the family textbooks of the period. Folsom (1934) and 
Baber (1939) each included chapters on the changing status of women. The 
most extended discussion was in Kirkpatrick. He presented an extensive 
analogy between women and blacks, an analogy first noted by G. Myrdal 
(1944) and developéd by Hacker (1951). Above all, Kirkpatrick explored 
social inventions that might reduce the costs of combining work and family 
life. His early concern with women's issues was manifested in his book Nazi 
Germany: Its Women and Family Life, published in 1938. 

In 1936 Kirkpatrick published "The Measurement of Ethical Inconsistency 
in Marriage" in which he distinguished three sets of marital roles, each 
implying a distinctive cluster of privileges and obligations. Kirkpatrick was a 
pioneer in describing a completely egalitarian relationship in his "partner" 
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option. Radical as he was on this score, he would be faulted by the feminists 
in his description of the traditional “wife-and-mother” role. Despite the 
enumerated costs incurred by wives, his reference to the "ethical consistency" 
of the traditional marriage carried the implication of a fair balance of privi- 
leges and obligations—a conclusion that feminists challenged (Kirkpatrick 
1936:444—46). 


Alva Myrdal and Viola Klein, Women's Two Roles: Home 
and Work (1956) 


This is the first of the "precursor" works that provides a comparison of the 
status of women in four industrialized countries in the 1950s: United States, 
France, Sweden, and the United Kingdom (England and Wales). The wealth 
of demographic and economic data assembled here will continue to serve 
historians. The book's most significant contribution lies in the analysis both 
of the similarities and the differences in the status of women in the four 
countries. In each of the four countries women constitute about one third of 
the labor force despite differences in ideologies. 

The authors’ acknowledged purpose was to address what they perceived to 
be a social problem: "Under present conditions, with an average family of 
only slightly more than two children . . . an average housewife can be 
considered to be employed full-time on tasks which are necessary for home- 
making during only one quarter to one third of her normal adult life" (p. 12). 
The solution they advocated is not “blind alley" part-time jobs but a sequential 
pattern of returning to full-time employment when children no longer require 
their mothers' full-time care. 

Simone De Beauvoir's ground breaking The Second Sex falls outside this 
review of sociology books. 


SOME CONTRIBUTIONS OF FEMINIST SCHOLARSHIP 
SINCE 1960 


The body of feminist writings in sociology since the 1960s covers a vast range 
of substantive areas—women in economic, familial, legal, academic, politi- 
cal, and religious institutions, dual-career families, sexuality, mothering, 
housework, widowhood, divorce, stages in women's life cycle, violence 
against women, women and crime, comparative and historical studies of 
women's lives, and others. Apart from such substantive contributions, the 
more theoretical feminist literature deals with issues of epistomology and 
methodology, including both a critique of mainstream sociology and outlines 
of alternative philosophical and methodological positions. 

A sense of this voluminous literature is conveyed by the sheer size of 
bibliographies in recent overviews of the field. Epstein (1988:241—89) cites 
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over 500 authors, some with several books or articles. Hess & Ferree (1987) 
is second in the number of listed references. 

Given the limits of this paper I have chosen to select those feminist writings 
that reveal flaws in mainstream sociology. In some cases feminist scholarship 
raised significant theoretical questions previously missing from the sociologi- 
cal agenda. For example, insofar as the very phenomenon of gender stratifica- 
tion had been muted prior to the emergence of the women's movement, 
related theoretical problems were understandably neglected. One of them was 
the complex set of social mechanisms that maintained and reproduced that 
system. 

We now recognize that these mechanisms range from rules of intersexual 
social interaction (so taken for granted that only careful field observation, if 
not ethnomethodology, can bring them to light) to institutional controls, and, 
ultimately, to force. 

Equally scarce were comparative studies of class and other differences 
within our society or between nations that could account for differences in 
gender stratification. 

Other failings in mainstream sociology stemmed not from ignoring certain 
issues but from uncritically accepting others. Traditional stereotypes of 
female and male personalities were too frequently unquestioned. Con- 
sequently, whenever sex differences in social behavior appeared to fit those 
stereotypes, the curiosity of mainstream sociologists rested. By contrast, 
some feminist studies directed the explanatory analysis toward social structur- 
al differences, achieving more valid results. 

The general area within which the foregoing issues are examined is women 
in the labor market, more specifically, at the frontiers of change—the resis- 
tances to women's entry into male-dominated occupations. The major con- 
tribution of feminist scholars in this area lies in the analysis of forces 
undergirding gender inequality in the labor market. 

By and large the expansion of women's paid employment since the 1960s 
took place in a labor market strongly segregated by sex. Despite some 
penetration by women into male-dominated occupations, the range of 
women's occupations is much narrower than that of men's. Male occupations 
continue to be higher in prestige, pay, and power (Ferree 1987:325). 

The civil rights legislation of the 1960s and the 1970s and the women's 
movement exerted pressure to open up opportunities for women (and, of 
course, for blacks and other minorities). The depth of male resistance and the 
strategies employed to circumvent this pressure became the focus of feminist 
studies. 

A preliminary task for feminist scholars was a critical analysis of some 
theories in mainstream economics and sociology that attempted to account for 
gender inequalities in the labor market on grounds other than discrimination. 
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It can hardly be an accident that these "defensive" theories developed with the 
emergence of the civil rights and the women's movements.? 

One of these explanatory theories is the neoclassical economic theory of 
*human capital." It postulates that women (and men) make econornic choices 
rationally in order to maximize their utility. Given women's roles as 
homemakers, it is rational for women to select occupations that do not require 
much training or mobility, do not penalize part-time or intermittent work, or 
do not require the total commitment that male providers can make to their 
jobs. 

Another explanation stresses personality differences between the sexes. 
Some trace these differences to biological antecedents—many more see psy- 
chological gender differences as the result of differential socialization of the 
Sexes. According to this interpretation, the resultant strengths, deficiencies, 
and, indeed, preferences of women, in contrast to those of men, account for 
gender segregation and inequalities in the labor market. 

The human capital theory has been tested by comparison of male and 
female subgroups who share the same capital status, i.e. the same education, 
training, work experience, attitude toward work, and continuity of employ- 
ment. Bergmann summarizes the results of such statistical analyses. They 
account for about one half of the.gender gap. Bergmann concludes, “Women 
who make heavy investments in human capital receive a reward for that 
investment far inferior to the rewards men get" (1986:765).* 

The second theory, that of the differential socialization of the sexes, no 
doubt plays a part in their occupational segregation, but the problem is too. 
complex for precise evaluation. I return to it in a later section. 

The human captial and the socialization theories take as a given the existing 
social system. The question they implicitly address is narrow: Can occupa- 
tional stratification in our society be explained by factors other than dis- 
crimination against women solely on the grounds of gender? Such frozen 
parameters limit the scope of the theories. By contrast, feminist scholarship 
advanced our knowledge beyond those limits. Microsociological in- 
vestigations located social mechanisms and deliberate strategies that maintain 
(or change) the extent of occupational and, more generally, gender stratifica- 
tion. Macrosociological comparisons between nations study the effects of 
various structural and cultural factors upon the status of women. I shall 
presently use Ruggie's (1984) comparison between Britain and Sweden to 
illustrate such a macrosociological approach. 


?The recency of these “defensive” theories may be inferred from the fact that of 170 books and 
articles cited in Bergmann’s critique, only 7 were published prior to the 1960s (Bergmann 1986). 

4See also England & McCreary (1987:286-320) for a fuller treatment of “Gender Inequality in 
Paid Employment.” 
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Some feminist sociologists went further in challenging the frozen assump- 
tions of the standard theories. One of them is the identification of “work” with 
paid work, thus excluding much work done by women. Ferree cites recent 
statistics showing that both housework and child care "remain over- 
whelmingly the women's responsibility" (1987:338). Acker suggested that 
expanding the concept of work to the presently excluded categories of house- 
work and volunteer work would raise new theoretical questions applicable to 
both sexes. For example, “the workloads of all individuals could be described 
in terms of . . . movement (between) paid and unpaid activities . . . We might 
speculate that those who are denied the experience of both realms would have 
a narrower view of human capacity . . ." (Acker 1978:141). 

Turning to recent feminist works on women's penetration into male- 
dominated occupations, we find analyses of barriers to change. These works 
include corporate business (Kanter 1977), law (Epstein 1981), medicine 
(Lorber 1984), the academy (Chamberlain 1988), blue-collar jobs (Hess, 
Marx, & Ferree 1987), and a general overview (Bergmann 1986). 

My first illustration of the social mechanisms that reinforce the prevailing 
male advantages is drawn not from the workplace but from the college 
classroom—in a sense, preparatory to employment. The study, directly stimu- 
lated by the women's movement (Hall & Sandler 1982), reports behavior 
rooted in such deep-seated attitudes that its findings would probably surprise 
the subjects of the study. 

Following are some excerpts: 


Men students are more likely to be called upon, thus communicating more interest in what 
they have to say and increasing their visibility in the classroom. 

Faculty interrupt women students more than male students, displaying a lack of interest 
in what women have to say. 

Male students tend to be "coached" more than women by professors who probe for a 
more elaborate answer, with questions such as . . . "Can you explain that more fully?" This 
not only gives men students more encouragement and opportunity to develop their ideas, 
but also implies the expectation that they know the answer and just need to work at it a little 
harder... 

What men say carries more weight. A suggestion or point made by a man is more likely 
to be . . . credited to him (‘as Jim said’), developed further in discussion . . . than when the 
same suggestion is made by a woman. (1982) 


It is hard to imagine that such behavior on the part of the faculty would not 
reinforce the self-confidence of male and the self-devaluation of female 
students. 

Epstein’s Women in Law (1981) provides an incisive analysis of the 
processes that retard sex desegregation in male-dominated professions. The 
core problem takes us back to Merton’s analysis of status-sets as the array of 
social statuses occupied by an individual. In a differentiated modern society, 
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status-sets are likely to be complex. Thus, a male lawyer may be a husband, a 
father, a member of military, religious, charitable, political, and other in- 
stitutions. 

The complexity of status-sets tends to generate conflicts for the individual. 
The demands of role partners in various statuses may not only compete for 
time and energy but may.be intrinsically contradictory. Merton identified 
social mechanisms that assist an occupant of a given status to accommodate 
conflicting and contradictory demands (Merton 1968:422—38). 

One mechanism of articulating the various statuses is the accepted practice 
of activating the status appropriate to the situation. Two lawyers may be 
adversaries in the courtroom, but collaborators in the Bar Association, and 
friends in a social club. What, then, makes it so difficult for the men to 
activate the role of a colleague in relation to a woman lawyer and makes them 
likely instead to focus on her femaleness? 

Citing Merton, Epstein quotes: * ‘Sex-typing’, the labeling of social roles 
as appropriately performed by men or by women, creates ambiguity and 
ambivalence for those who choose a profession traditionally associated with 
the other sex" (Epstein 1981:276). This, however, is only a starting point. 
The interpretation appears to center around three elements. Sexual identity is 
a deeply internalized component of personality, linked to changing but still 
powerful stereotypes of female and male personalities as well as to sex roles. 

The second element is the difference between the professional role of a 
lawyer, on the one hand, and the feminine role, on the other. Epstein points 
out the woman-lawyer's double bind. Should she exhibit some traditional 
feminine trait, such as a lack of assertiveness, her male colleague will be the 
first to point out her unfitness for the profession of law. But a woman who is 
capable of the self-assurance and aggressiveness demanded and admired in a 
male colleague cannot win either. In violating the male ideal of femininity, 
she is judged to be hard, humorless, competitive—in a word, "masculine." 
Possibly, femaleness is a "master status" overshadowing any other in a 
woman's status set. If so, the woman colleague is seen first and foremost as a 
woman, thus activating a male-to-female orientation. I am inclined to think 
that the sexual component is not the primary element at work here. Despite 
risks of sexual harassment, women in subordinate positions have long been 
accepted. : 

The third element is the male socialization for superiority over women in 
the workplace. The sentiment that makes collegiality difficult is the hostility 
toward women who threaten male self-esteem and power as a colleague. In a 
competitive society, male colleagues expect to cope with the usual strains of 
competition. But a superior (indeed, even an equally good) performance of a 
female colleague is experienced by the male colleague as a more crushing 
defeat. Moreover, while the competition of other men is taken for granted, the 
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danger signal is sounded when a group defined as “the other" invades the turf, 
creating a larger pool of competitors. In the words of a college senior in my 
study (Komarovksy 1976:26): "When I went to take the law boards, I felt 
shocked to see all those girls . . . It's a threat to the security of all men in the 
profession." 

The deep roots of male resistance surfaced in my study of Columbia 
College seniors (Komarovsky 1976). The cultural milieu was liberal with 
sanctions against sexism. Possibly their youth made the seniors particularly 
vulnerable to perceived threats to self-esteem. On the other hand, they were 
yet to experience the full advantages of the superior power of male adulthood 
and could still afford to be idealistic. 

Whatever the assumptions, the interesting findings were not only a few 
direct expressions of sexism but the more prevalent ambivalence. Egalitarian 
pronouncements coexisted with a deeper layer of traditional attitudes. These 
inconsistencies occasionally caused guilt but much more frequently were not 
recognized. A senior declared full support for equal opportunities for women 
in professions and in business, adding a qualification in the course of the 
interview: 


A woman should not be in a position of firing an employee. It is an unpleasant thing to do. 
Besides, it is unfair to the man who is to be fired. He may be a very poor employee, but he 
is still a human being and it may be just compounding his unhappiness to be fired by a 
woman (p. 36). 


Some men endorsed goals for women while censuring the means necessary 
for their attainment. One such senior upheld the principle of free choice of 
professions for women, specifically including medicine. In a subsequent 
interview, in answer to our standard question about unfeminine behavior, he 
cited "too great a concern with grades." He found such concern repugnant 
also in a man. But he made allowances for a male premed student, anxious 
about admission to a good medical school, who would go to see a professor 
about a C on a chemistry test. In a girl, such an act would be *obnoxious" and 
he would be completely "turned off" by a girl who was similarly concerned 
about a poor grade (p. 27). 

I might add, parenthetically, that my follow-up studies of Columbia and 
Barnard students (Komarovsky 1985) testify to shifts in attitudes in the 
direction of egalitarianism. 

The early concentration on white professionals and the relative neglect of 
class, race, and ethnicity in feminist writings have been justly criticized, and 
there are signs of a more balanced future orientation. Ferree (1987:333—35) 
demonstrates that between 1960 and 1985 women had little success in 
penetrating skilled blue-collar jobs, traditionally dominated by men—much 
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less success than was the case in professions. The author goes beyond 
statistical facts and presents an illuminating interpretation of the differences 
between the two occupational categories. Who controls the access to elite 
professions as against skilled blue-collar jobs, how widely visible are the 
resistances, and consequently, how susceptible to public pressures, what are 
the differences in the ideological legitimation of sex segregation—these and 
other elements enter the interpretation of the relative failure to desegregate 
skilled blue-collar jobs. 

Another type of feminist contribution corrects a flaw in mainstream sociol- 
ogy referred to in the introduction to this section. A gender difference in the 
workplace was dismissed as unproblematical since a familiar "common 
sense" explanation was readily at hand. 

Many past studies attested to sex differences in attitudes toward work. The 
familiar results indicated that men were more task-oriented, work involved, 
and ambitious. Women, on the other hand, cared more about congenial social 
relations at work and were less motivated to seek advancement. These 
differences fitted gender stereotypes so closely that the curiosity of the 
researchers was satisfied. 

Going beyond the psychological stereotypes, Kanter examined social 
structural settings of work (1977:129-63). Drawing on past investigations, 
she cites findings that in production as well as in clerical jobs, women tend to 
have "shorter chains of opportunity" (p. 159). Opportunity structure, in turn, 
shapes behavior. People on “high mobility tracks" tend to develop work 
commitment and upward orientations. Conversely, cycles of disadvantage are 
also perpetuated: Lack of opportunity to advance limits or depresses aspira- 
tions and enhances the values of sociability of peer groups at work. 

This social structural interpretation is strengthened by studies of deviant 
situations. These are male workers with little chance of promotion and, 
conversely, women in jobs offering strong professional opportunities. The 
opportunity structure and not the sex of the worker proves the stronger 
determinant of behavior and attitudes on the job. 

The contributions of feminist scholarship in the foregoing pages (with the 
exception of the comparison between professional and blue-collar occupa- 
tions) were microsociological in scope. Mary Ruggie's book, The State and 
the Working Woman: (1984), serves as an illustration of a macrosociological 
study of differences in the status of women in two societies. This study is rich 
in descriptive data on women's employment and on the availability of social 
services such as child care, parental leaves, pay equity, occupational training, 
job sharing, and job placement in the two countries. More significantly, it 
offers a convincing interpretation of the differences. 

In principle, both Britain and Sweden attest to their goal of achieving sex 
equality. In fact, however, Britain lags strikingly behind Sweden. The un- 
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precedented increase in paid employment of mothers with young children 
failed to generate policies such as those enumerated above. By contrast, in 
Sweden the coalition of government, business, and labor resulted in a com- 
mitment to facilitate the entry of women into the labor force and to cope with 
its consequences. 

Ruggie sees the proximate causes of the differences between Britain and 
Sweden in the state-society relations. Ostensibly similar as these two “wel- 
fare” states are, the author spells out their differences. She defines Britain as 
closer to-the “liberal welfare” model of the state in contrast to the Swedish 
“corporatist” welfare model. In the liberal state, market forces are given freer 
play arid the state intervenes only when the market forces result in what 
society defines as intolerable inequities. “In the corporatist welfare model, the 
instruments of state interventions are designed to transform the market so as to 
avoid such imperfections” (Ruggie, p. 13). As Ruggie amplifies the dis- 
tinctions, “a corporatist welfare state intervenes to lead the market and to 
institutionalize social equality, not simply to compensate for inequality” (p. 
14). 

Probing deeper into the state-society relations, the author examines the 
position of labor in the two countries. Labor has become more strongly 
integrated into the governing coalition of Sweden, whereas the interests of 
labor in Britain are not on a par with other economic considerations (p. 18) 
(See also Epstein & Coser 1981). 

The foregoing section dealt with a small subfield of the now vast area of 
feminist scholarship. My purpose was to demonstrate the contribution this 
scholarship can make to the discipline by challenging the superficiality of 
some accepted answers and by posing significant new questions. 


FLUCTUATING PERSPECTIVES IN FEMINIST THEORY 


In this final section I discuss some shifting perspectives in feminist theory and 
examine some current polemics. 

Until the 1970s feminist writings stressed the impact of differential 
socialization upon adult personalities of men and women. Whether drawing 
upon the existing literature or charting new research, the underlying feminist 
quest was explicit: “What in socialization of children and adolescents might 
account for traits that are generally regarded as feminine deficiencies in 
comparison with men?” One example of such research is the study of prize- 
winning picture books for preschool children. The study revealed that 
“women are greatly underrepresented in the titles, central roles, and illustra- 
tions. Where women do appear, their characterization reinforces traditional 
sex-role stereotypes: Boys are active while girls are passive; boys lead and 
rescue others while girls follow and serve others. Adult men and women are 
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equally sex-stereotyped: men engage in a wide variety of occupations while 
women are presented only as wives and mothers" (Weitzman et al 1972:1—2). 
Little girls receive attention and praise for their attractiveness, while boys are 
admired for their achievements and cleverness. 

It is hard to escape the conclusion, comment the authors of the article, that 
such rigid sex role portraits in the children's books have a deleterious effect 
on the self-image and aspirations of women. Several other studies are cited 
showing that even young children know and express sex stereotypes. 

This preoccupation with differential socialization was no doubt motivated 
by the felt need to expose the social roots of "feminine deficiencies" and, 
more generally, the mechanisms of maintaining gender inequality. The poli- 
tical message was clear: "It is not in our genes." A related feature was the 
emphasis on considerable overlapping of scores on male and female psycho- 
logical tests. This overlapping was thought to be all the more significant given 
the tendency to publish positive rather than negative results in studies of group 
differences (for an early reference to overlapping, see Komarovsky 1953:20- 
24). 

With the passage of time we note, if not the muting of references to 
childhood and adolescent socialization, at least the emergence of a new 
theme. Feminist writers began to highlight the existing literature that con- 
firmed the new theme and also to undertake research specifically oriented to 
new hypotheses. "Yes," states the underlying new message: "Personality is 
shaped by early socialization, but let us not succumb to the doctrine of rigid 
determinism. It is time to recognize the potential for change as people move 
through life and encounter new opportunities and new rewards. This is 
especially applicable to women at a time of such rapid transformation in 
conditions and values affecting their lives." 

I do not claim that this emphasis on malleability of human personality had 
its origin in the women's movement. Too many investigators, unaffiliated 
with feminism, were working in this area, perhaps as a reaction against 
theories that regarded childhood experiences as lasting determinants of adult 
personality. This reaction may reflect a familiar pattern of intellectual de- 
velopment in behavioral sciences. À line of theory and investigation grows, 
becomes accepted, and eventually induces criticism. 

But if the controversy over relative consistency of lifetime behavior did not 
originate in the women's movement, the side taken by a number of feminist 
scholars in this debate appears to have an ideological component. It is not by 
chance that Epstein summarizes at length investigations that report changes in 
some personality traits with changes in social conditions (1988:137—40). This 
interpretation shifts the emphasis from negative socialization to blocked 
opportunities as the major problem for women. Epstein records the rising 
professional aspirations of women lawyers as changing conditions and their 
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own success served to stimulate their ambitions (see also Gerson 1985). I 
referred to an ideological component in the rejection of a doctrine of a rigidly 
fixed personality. The doctrines of rigidity, after all, serve to bolster the 
discrimination against women. Male gatekeepers could all too easily claim 
that reforms must await the emergence at some future time of a generation of 
women properly socialized to assume new challenges. The imputation of this 
political motivation in no way detracts from the potential scientific contribu- 
tion which must be assessed on its own merits. . 

The emphasis on malleability of personality fits into a larger theme of 
recent feminist writings in history and sociology. The broader purpose is to 
correct the invisibility of women's activities and to bighlight the extent to 
which women, though subordinate, were nevertheless active agents in shap- 
ing their lives and in the making of history (Thorne 1987:88—9). 

Finally, still another current of feminist theorizing is represented by such 
works as Rae Carlson, *Sex Differences in Ego Functioning: Exploratory 
Studies of Agency and Communion" (1971); Nancy Chodorow, The 
Reproduction of Mothering (1978); Jessie Bernard, The Female World 
(1981); Carol Gilligan, In a Different Voice (1982), Alice Rossi, Gender and 
Parenthood (1984), and Evelyn Fox Keller, Reflections on Gender and 
Science (1985). 2 

Whereas the previous writers attacked the traditional stereotypes of 
femininity, used to legitimize women's subordinate status, the third wave 
abandoned this defensive strategy. Instead, it challenged the value system of 
the dominant group and, in the redefinition of values, found much to honor in 
` some distinctively feminine attributes. 

This feminist orientation is exemplified by Gilligan's In a Different Voice 
(1982). I shall assume the reader's familiarity with Gilligan's ground- 
breaking distinction between the “ethic of justice,’ based upon abstract princi- 
ples, and the "ethic of care," based upon a sense of care and responsibility for 
the protection of relationships. "The morality of rights differs from the 
morality of responsibility in its emphasis on separation rather than connec- 
tion, in its consideration of the individual rather than the relatioüiship as 
primary . . ." (Gilligan 1982:18). 

Despite her statement that the "different" voice in the title was deliberate 
and was not intended to mean the “women’s” voice, it is difficult to accept 
this disclaimer. After all, Gilligan attributes the previous invisibility and 
devaluation of the "ethic of care" to Piaget, Kolberg, and others who regarded 
male moral development as the norm. Moreover, Gilligan turns to women in 
order to illustrate an ethic that differed from the "ethic of justice." 

Gilligan accuses her critics of simultaneously declaring her evidence in- 
adequate and her conclusions obvious (1986:325—26). Ironically, she may be 
right. As to evidence, her book does little more than illustrate the “ethic of 
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care." Nevertheless, the depiction of womén ‘as’ superior to men in empathy, 
nurturance, capacity and wish for close selationships—all resonate in our 
experience. 

But is this resonance anything more than our familiarity with the traditional 
stereotype of feminine virtues? And, apart from truth claims, does the wide 
appeal of Gilligan's book lie in a moral vision that promises some surcease 
from our conflict-ridden competitive society of self-centered, alienated in- 
dividuals? There are some policy implications in this celebration of feminine 
morality. It fosters a familiar appeal to women to improve our imperfect 
world through the exercise of the distinctively feminine virtues. In contrast to 
the nineteenth century, in the twentieth this moral redemption can no longer 
be located in the domestic sphere but must include the public spheres of the 
professions and politics. I wonder whethe- this redemptive mission can be 
assigned to the relatively powerless segment of the population. 

One parenthetical and somewhat ironic reminder: Talcott Parsons was a 
major target of feminist criticism. Parsons a»proached the distinction between 
instrumental and expressive roles from the point of view of the functional 
requirements of a social system. The family; as any social system, must solve 
the instrumental problems of adaptation to the environment on the one hand 
and of internally maintaining cohesion, selidarity, and integration, on the 
other. Within the family the instrumental functions are typically performed by 
men and expressive functions by women. It would appear that the psychology 
of Gilligan's women, however different in formulation, would uniquely equip 
them to perform Parson’s expressive function. 

Leaving aside broader issues of moral and political theory, Gilligan's 
insight points to two directions of investigacion. The first requires the testing 
of the insight in a broad variety of class, rzcial, ethnic, and other groups—a 
program Gilligan endorses. 

Optimally, the tests should be culturally equivalent. Kerber pointed out that 
the moral reasoning of women facing abortion must be compared to an 
equivalent struggle in men's lives; for example, one over draft resistance 
(Kerber 1986:305). Absent such comparisens, we cannot know whether an 
ethic of care is gender specific. Moreover tests of moral reasoning should 
ideally be supplemented by tests of behav or. 

This is one direction of research. Equally intriguing and complex is the 
search for an explanation of such gender differences in moral orientations as 
might be confirmed. Two brilliant recent contributions suggest two different 
explanatory frameworks. Chodorow (1978) presents a psychoanalytical 
theory of identity formation of the sexes -hat could lead logically to their 
distinctive moral perspectives. Hochschild's (1983) theory might also account 
for the two types of moral reasoning. The root cause of her explanation, 
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however, is the powerlessness of women. (“On Reproduction of Mothering: a 
Methodological Debate,” Signs 1981:482-514). 

To remind the reader, the foundation of Chodorow’s argument is the fact 
that mothers mother both sons and daughters. Girls, in identifying themselves 
as female, fuse the experience of attachment with the process of identity 
formation. For girls the self is “delineated through connection,” and they are 
threatened by isolation. Moreover, “Girls emerge with a stronger basis for 
experiencing another's needs or feelings as one's own . . .” (Chodorow 
1978:167). By contrast, boys define themselves through separation from their 
mother. Their development entails greater individuation, which is threatened 
less by isolation and more by intimacy. 

Hochschild (1983) defines *emotion labor" as an instrumental use of one's 
emotions in work or in private life. Using as basic data her study of flight 
attendants, Hochschild shows that it is what the author terms the “emotion 
labor" that is emphasized in selection and training of flight attendants. This 
labor must induce in the passengers “a sense of being cared for in a convivial 
and safe place" (p. 7). It demands on the part of the attendants "emotion 
work" in suppressing some feelings and expressing others in a manner that 
appears sincere. 

“Both men and women do emotion work in private life and at work . . . But 
in the whole realm of emotional experience, emotion work" (p. 162) plays a 
greater part in women's lives. To begin with, the author estimates that about 
one half of women but only a quarter of men are employed in jobs requiring a 
substantial use of emotional labor (p. 236). In both public and the private 
spheres women carry the heavier share. The major contribution of the analysis 
lies in linking women's larger share of emotion work to their subordinate 
status. 

Lacking the resources of money, power, authority, and status, women offer 
to men extra emotion work in exchange for sharing other resources: “They 
affirm, enhance, and celebrate the well-being and status of others” (p. 165). 
This may well include not only spontaneous feeling but “feminine wiles,” the 
managed expression of feelings for instrumental purposes. Women may 
acquire thereby the “secondary gains” of greater sensitivity to nonverbal 
communication and greater skills of manipulation in interpersonal relations. 

Chodorow and Hochschild open up new directions for inquiry. If women 
are indeed more empathetic than men, is it Chodorow's or Hochschild's kind 
of "empathy"? Chodorow claims that women tend to experience "another's 
needs or feelings as their own . . ." (Chodorow: 167). Hochschild sees 


*Gilligan in an earlier publication also hinted at women's lack of power as a possible element 
in the abortion study (Tronto 1987:649). 
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empathy as attentiveness to another's feelngs developed by a subordinate 
group as a survival skill. And, again, is women's alleged tendency to negoti- 
ate conflicts based on care or on the acquiced habit of the weak to “master 
anger—in the service of ‘being nice’ " (Hcchschild:163)? Safeguarding hu- 
man relationships is more important the more one's welfare depends on them. 

These seemingly competing interpretatiors are derived from different bod- 
ies of theory. But the issue need not remain z matter of individual preferences. 
Strategically designed studies may eventually yield much more complex 
patterns of interweaving psychoanalytic and social structural processes. 
Theoretical interest is exemplified by Chodorow (1989) and Alexander et al 
(1987). One recent line of investigation is promising. I refer tc studies of 
"men who mother,” being divorced or seperated, (Risman 1987) or fathers 
who share caretaking of young children (Coltrane 1988' and appended 
bibliography). The characteristics and consequences of shared parenting are, 
of course, also a major issue of social policy. 


AFTERWORD 


The shifts and varieties of feminist thought sketched above give rise to 
theoretical polemics. One such debate is between the maximalists and the 
minimalists. Among the maximalists some see psychological sex differences 
as rooted in biology; others claim that, whatever their origin, the differences 
are wide, significant, and resistant to change. The minimalists, on the other 
hand, point to wide variations of personality types within male and female 
groups and tend to attribute many existing differences to the structural and 
cultural characteristics of patriarchy. 

One element in this debate may be the underlying time frame. The present 
and the near future constitute the backgrourd of the maximalist position. In 
1953 I stated: “To be born a woman means to inhabit, from early infancy to 
the last day of life, a psychological world which differs from the world of 
men" (Komarovsky 1953:18). Much has chaaged since that time, but it is still 
unrealistic to deny that men and women cortinue to live in different worlds. 
Sociologists who undertake to study the distinctiveness of the female worlds 
(since these differ by class, race, ethnicity, sexual preference, and other 
features) make important contributions. 

The minimalists, I suspect, pose a different question, a narrower and a 
policy-oriented one. They do not cover the whole range of possible gender 
differences. Instead they inquire: "What particular differences tend to per- 
petuate women's narrower life choices anc their subordinate status in the 
public sectors?" 

I have taken a two-fold position on this issue. Equal opportunity for women 
in public spheres cannot be realized as long as we maintain gender-role 
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differentiation within the family with no options for individual preferences. 
Secondly, in order to provide real options for men and women we shall have 
to reorganize economic and other institutions in a profound way, more 
profound, in my opinion, than would be necessary, for example, to solve the 
problems of the black minority in the United States. 

We can glean the difficulties ahead by looking back at the zig-zag pattern of 
changes in women's status. Demographic, economic, and cultural social 
changes resulted in a series of lead-lag dislocations. Instead of moving toward 
a new equilibrium, the social reaction has repeatedly been a call for retrench- 
ment. The pressure to return to the past, idealized in retrospect, can be 
illustrated in the sphere of women's education. Higher education for women 
was a progressive move against much resistance: "The more abstract and 
severe branches of collegiate study will put too heavy a strain upon female 
health and vitality" (cited by Komarovsky 1953:37). Women met the chal- 
lenge of higher education, but neither the ideal of femininity nor the options 
for life-style patterns changed sufficiently to solve some problems for this 
group of women. 

Instead of a movement toward social innovations we witnessed a strong 
conservative post-World War II backlash. The backlash was fed by many 
influences, but higher education of women was a strong target of criticism. 
President of Mills College, Lynn White advocated a "distinctly feminine 
college curriculum.” It was certainly not “women’s studies" that White had in 
mind: "Would it be impossible to present a beginning course in foods as 
exciting and as difficult to work up after college, as a course in post-Kantian 
philosophy would be?" (1950:49). 

Turning to the future, it is still an open question whether pregnancy and 
lactation require that the nurturant function be performed exclusively by 
mothers and for what duration in infancy. As to men's capacity to share in the 
nurturance of infants, we will know only if society allows men, as ours does 
not, to develop whatever potential for nurturance they possess. The present 
chilly climate for such male functions was reported in a recent newspaper 
article. Several male kindergarten teachers found it expedient to describe 
themselves as elementary school teachers, fearing that the truth would arouse 
scorn and suspicion in encounters with strangers. 

Social investments in child care, maternity and paternity leaves, flexible 
work hours, job sharing, and other changes will be required to balance the 
private and the public worlds for both men and women. 

Such a vision would seem utopian were it not for the need of most women, 
single, married, divorced, with or without children, to work, and were it not 
for the social need for women workers. For a smaller segment of women in 
the labor force the non-economic rewards of an occupation are the major 
incentive. 
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It is true, that given the current political climate, a problem must reach 
crisis proportion before the state begins tc respond. The recent failure of the 
state to provide parental leave and adequate childcare exemplifies such de- 
layed action. 

The long-range outlook, I believe, is more optimistic. The demographic, 
economic, and cultural trends that are changing women’s lives and power are 
not likely to be reversed. In the long run, the persistence of traditional roles 
will create such stress and contradictions as to generate an irresistible pressure 


for social reorganization in the direction of sex equality. 
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Abstract 


The proliferation of research on the US labor movement has created oppor- 
tunities for the development of theory on labor action. Such theory would 
account for variations in the development, structure, ideology, goals, func- 
tions, social composition, and societal impact of labor unions and, more 
generally, the labor movement. This review of recent sociological research 
focuses on two sets of enduring questions to which this research has been 
addressed. The first concerns the development of the U.S. labor movement— 
that is, its connection to industrialization; its internal organizational and 
ideological development; and worker predispositions to unionize and conduct 
strikes. The second pertains to the socioeconomic and political impact of the 
U.S. labor movement—i.e. its impact on worker livelihoods and life chances 
and social inequality, as well as its universalizing effects on the determinants 
of inequality; and, its impact on working class political participation and the 
shaping of social policy. This review concludes with suggestions for redirect- 
ing research in order to realize the growing opportunities for developing 
theory on labor action. 


INTRODUCTION 


Over the last century, the labor movement has continued to be the chief effort 
made by workers to address problems associated with social inequality. 
Consequently, the development and impact of the labor movement on social 
inequality and politics pose several basic sociological questions. When during 
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industrialization does the working class unionize, and which social segments 
of this class are most likely to do so? Under which conditions do working 
class segments develop class-wide transfo-mative ideologies as opposed to 
segment-specific economistic ideologies? Which types of unions are most 
likely to bureaucratize and conform to Mickels’s "iron law of oligarchy”? Has 
unionism reduced socioeconomic inequality equally among different working 
class segments? Have unions substituted un versalistic forms of socioeconom- 
ic inequality for particularistic and ascriptive ones? Under which conditions 
do unions attempt to accomplish their goals in the private marketplace, 
through collective bargaining, or in the polity, by making legislative claims 
on the state? Why and when do unions comprising members of different 
working class segments make different legislative claims—i.e. claims for 
redistributive law versus law that legitimizes unionism and collective bargain- 
ing? Under which conditions is organized labor most successful in realizing 
its collective bargaining and legislative gcals? 

These are among the questions a socioloeical theory of labor action ought 
to address. Yet, little sociological research has been directed at developing 
such a theory—that is, a theory that address2s longitudinal and cross-sectional 
variations in the development, structure, xleology, goals, functions, social 
composition, and societal impact of labor unions.  ' 

This review of research on the US labor movement emphasizes the work 
published by sociologists since 1980!; it comsists of three sections. The first is 
a review of research on several facets cf the development of the labor 
movement. The second addresses the impact of unionism on diverse forms of 
socioeconomic inequality and politics. Finally, I conclude with some sugges- 
tions for redirecting sociological researcł in ways that will facilitate the 
development of a theory of labor action. 


THE DEVELOPMENT OF THE LABOR MOVEMENT 


As the labor movement emerged with industrialization, and as it has recently 
declined with “post-industrialization,” its Eleology, structure, and functions 
changed. Also, diverse workers have becone involved in the movement for 
different reasons. 


'The following is a partial list of journals and anaual book series which the reader should 
consult for the voluminous body of labor research which has been conducted by economists, 
historians, industrial relations experts, political scientists, psychologists, and others: Academy of 
Management Journal, Advances in Industrial and Labor Relations, Economic and Industrial 
Democracy, Industrial and Labor Relations Review. Industrial Relations, Journal of Labor 
Economics, Journal of Labor Research, Labor History, Labor Studies Journal, Monthly Labor 
Review, Monthly Review, and Research in Personne’ and Human Resources Management. 
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Industrialization and Unionization 


According to the classical Marxian and functionalist theories of unionization, 
unionization is generated spontaneously by the emergence of an inherently 
alienating, impersonal, and economically insecure large-scale capitalist, 
urban-industrial society.” The classical theories seemed to be consistent with 
the development of the US labor movement through the end of World War I. 
By that time the large corporation had emerged as the dominant economic 
institution, and unions had successfully organized heavy industry. By 1953, 
the percentage unionized had climbed to 42% in manufacturing, 65% in 
mining, 84% in construction, and 80% in transportation (Troy & Sheflin 
1985, ch. 3, p. 15). However, the classical theories did not anticipate the 
decline of US unionism during the post-World War II era. From its all-time 
high of 36% in 1945, the percentage unionized of the US non-farm labor force 
had fallen to 17% by 1989. By 1989, the percentage unionized in particular 
sectors had dropped to 22% in manufacturing, 18% in mining, 22% in 
construction, and 30% in transportation (US Bureau of Labor Statistics 
1979:507, 1990:232). 

Five explanations of post-World War II union decline address the loss of 
existing membership, obstacles to the recruitment of union members, or both. 
First, Bluestone & Harrison (1982) argue that geographical redistribution of 
capital into nonunion areas has caused plant shutdowns and union member- 
ship losses. Second, Lipset (1986) argues that the US culture of individualism 
and negative public opinion about unions increasingly disinclines workers 
toward unionization (Lipset & Schneider 1987). Third, several writers have 
argued that union decline has resulted from growing employer resistance to 
unionization (Cornfield 1986, Fantasia 1988, Goldfield 1987, Jacoby, 1990, 
Kochan et al 1986, Robinson & Mcliwee 1989)—Tresistance which has been 
encouraged and facilitated by the enactment of antiunion labor law, beginning 
with the Taft-Hartley Act of 1947 (Freeman & Medoff 1984:242-3, Huxley et 
al 1986). Fourth, post-World War II union decline has been attributed to the 
increased diversity in status of the pool of potentially organizable workers in 
the United States (Aronowitz 1983). Fifth, Bell (1973) argues that the shift 
from a blue-collar, goods-producing economy toward a white-collar, service- 
providing “post-industrial” economy has generated union membership losses 
as well as led to employment gains among such allegedly union-averse 
workers as white-collar and women workers (also, see Griffin et al 1990). 
However, in contrast to Bell’s argument, women and white-collar workers 
have increasingly unionized during the post-World War II era, glbeit ar lower. 





?See Comfield (1986) for a review of these theories. Also, Larson & "Níssen' s (1987) N 
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rates than the rate of decline of blue-coller union membership (Cornfield 
1987c, 1987d:7, Moore 1986, Troy & Sheflin 1985, ch.3:15). 

Virtually no research has compared the relative strength of the five explana- 
tions of union decline (Dickens & Leonard 1985). Furthermore, it is impor- 
tant to distinguish among industries, occupations, and worker subpopulations 
in assessing the strength of different causes of union decline because such 
causes are likely to vary among these groups, as are rates of union member- 
ship growth or decline (e.g. Kokkelenberg & Sockell 1985). 

The treatment of unionization in the claszical theories as a spontaneously 
generated process not only fails to address urion decline: it also overlooks the 
tremendous obstacles to unionization as well as the halting manner in which 
the US labor movement developed, especially compared to such development 
in social democratic nations (Griffin et al 1989). Applications of resource 
mobilization and neo-Marxian theories to the development of the US labor 
movement also indicate that, worker grievences notwithstanding, unioniza- 
tion is not a foregone conclusion (e.g. the May 1988 issue of Theory and 
Society, Billings 1982, 1990, Conell & Voss 1990, Griffin et al 1986, Jenkins 
1985:7, Katznelson 1986, Rubin et al 1982, Wallace et al 1989). 


The Ideological Development of the Labor Movement 


The ideological development of the US labo- movement is complex by virtue 
of the diversity of, and changes in, the ideologies espoused by different 
worker groups. Despite the diversity and dynamism of US labor ideology— 
i.e. the self-proclaimed societal role of labor organization—many have char- 
acterized the whole US labor movement as exceptional in its conservative 
“business unionism” and, unlike Western European labor movements, in its 
failure to develop a socialist philosophy amd a labor political party (Marks 
1989, Poole 1986). 

In presenting recent research on the ideological development of the US 
labor movement, it is helpful to distinguish between the three, ideologically 
divergent, labor federations that are commcnly regarded as constituting the 
modern labor movement: (a) American Feceration of Labor (AFL), 1886— 
1955; (b) Congress of Industrial Organizatons (CIO), 1936-1955; and (c) 
AFL-CIO, 1955-present. More recent research has been addressed to the 
AFL than to the other federations, and almost no research has been addressed 
to the AFL-CIO (Cornfield 1989b). 

AFL conservatism has been attributed to ethnic and racial divisions within 
the working class (e.g. Mink 1986) and to repressive state action against 
organized labor (e.g. Forbath 1989, Montgcmery 1987, Shefter 1986, Tom- 
lins 1985). However, analyses of central id2ological tendencies of the AFL - 
miss and obscure variations in ideology amoag the national union affiliates of 
the AFL (e.g. Haydu's 1988 analysis of machinists). 
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The CIO ideology was to the left of AFL ideology insofar as the CIO, in 
contrast to AFL voluntarism, strongly supported the development of a nation- 
al welfare state during the New Deal era. This ideological difference has 
largely been attributed to the lower skill level, and hence weaker market 
bargaining position, of the largely nonskilled factory worker CIO member- 
ship, compared to the skilled AFL craftsmen (e.g. Greenstone 1969, Marks 
1989, Quadagno 19882, Stevens 1986). 

Within the- CIO, some national unions were led by Communist party 
members. Variation among national unions in the likelihood of having Com- 
munist leadership has been attributed to the extent of capitalist resistance to 
unionism and to variations in the organizing strategies of unions and the 
Communist party (Kimeldorf 1988, Stepan-Norris & Zeitlin 1989). More 
research is needed on the impact on the development of Communist leader- 
ship of such worker social characteristics as ethnicity. 

What little research exists on AFL-CIO ideology suggests that it embodies 
traits from the AFL and CIO ideologies (Cornfield 1987d: ch. 1, 1989b). A 
small body of research based on recent surveys of individuals suggests that 
unions of the AFL-CIO era may promote a higher degree of class conscious- 
ness or identification with the “working class” in their memberships than that 
felt by the average nonunion worker (Form 1985:216-218, Simpson et al 
1988, Zingraff & Schulman 1984). Unresearched are any mechanisms—e.g. 
labor education programs conducted by unions—by which unions affect 
worker "consciousness"; variations among unions in consciousness-raising 
activities and effectiveness; which subgroups of union members are most 
receptive to class-based ideologies; and comparisons of the levels of such 
forms of nonclass consciousness as racism, sexism, and other types of 
prejudice and intolerance between union and nonunion workers. 


Individual and Group Predispositions to Unionize 


Much of the research conducted by industrial relations experts on individual 
and group predispositions toward unionism assumes that the ideology of the 
US labor movement is “business unionism” and, consequently, that workers 
are more likely to be prounion and to unionize, the more instrumentally 
valuable they perceive unions to be (Fiorito et al 1986, Kochan 1979). 
Approximately one third of nonunion workers would vote for unionization if 
an election were held in their workplace (Freeman & Medoff 1984:29). 
Prounion attitudes tend to be associated negatively with job satisfaction and 
positively with “union instrumentality.” Findings on the impact of de- 
mographic and socioeconomic status variables on union attitudes are in- 
consistent, except for identifying the persistent tendency of blacks to be more 
prounion than whites (Fiorito 1987, Fiorito et al 1986). The inconsistency of 
these findings may result from the uniqueness of the employment issues that 
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motivate a particular worker subpopulatioa such as white-collar workers to 
unionize (Kochan 1979). Klandermans's (1984) synthesis of the resource- 
mobilization and expectancy-value theories is similar to the concept of “union 
instrumentality." 

What little sociological research. there is on predispositions toward union- 
ism has tended to address individual differences in attitude within worker 
subpopulations. The most commonly resezrched worker subpopulations are 
professional workers, especially predominaatly female professions with some 
tradition of unionism (e.g. Falk et al 19€2, Gray 1989, Stern & Murphy 
1980); women clerical workers (e.g. Buron 1987, Cornfield 1987c); and 
Southern textile workers (e.g. Billings 1990, Leiter 1986, Reif et al 1988, 
Simpson 1981). Research-worthy worker subpopulations that have been ne- 
glected in sociological research include social classes as well as occupational, 
gender (Cook et al 1984, Fiorito & Greer 1986, Milkman 1985), and ethnic/ 
racial groups (Schutt 1987). National studies which address the unique issues 
that motivate worker subpopulations to urionize would also be a valuable 
addition to the sociological research literatire. Despite the strong impact of 
unionism on worker identification with the “working class" (Form 1985:216— 
218), the effects of class identification on -he propensity to unionize remain 
unresearched. 

Almost no research has investigated the- impact of diverse organizational 
forms—e.g. the conventional bureaucracy versus workplace democracy—on 
worker attitudes toward unionism. Rosenstein & French’s (1985) findings that 
workers in an employee-owned firm were more prounion the less they partici- 
pated in the ownership share plan suggest that unionization may be most 
compatible with the conventional, hierarckical bureaucratic workplace. 


The Organizational Development o? the Labor Movement 


The development and consequences of con-radictory democratic and bureau- 
cratic tendencies within labor unions are the enduring issues addressed by the 
voluminous research literature on the organizational development of labor 
unions (for reviews of this literature, see Cornfield 1987a:201—3, Edelstein 
and Warner.1979, Fiorito & Hendricks 1987, Poole 1981, and the May 1977 
issue of Industrial Relations). These issues are captured in Robert Michels's 
thesis on the "iron law of oligarchy," presented in his 1911 book Political 
Parties, which posits the inevitability of hierarchization of structure and 
oligarchization of leadership in otherwise fermally democratic organizations. 
This body of research may be divided according to the two subthemes of 
Michels's iron law: i.e. (a) structural charge and (b) the oligarchization of 
leadership. 
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STRUCTURAL CHANGE The little research on union hierarchization has 
mainly addressed the process by which geographically separate local unions 
of the same craft merged to form national craft unions during the nineteenth 
century. These pioneering longitudinal analyses, such as the work of John 
Commons and his associates, attribute union nationalization to the geograph- 
ical extension of product and labor markets (see Cornfield 1987a and 
1989a:31 for a review of these studies). Hannan & Freeman's (1989:201—293) 
population ecology thesis, in contrast, attributes the emergence of national 
unions to the changing density and magnitude of the resource base of the 
population of national unions. The market-extension and population ecology 
theses suggest the motivating and facilitating forces, respectively, for the 
appearance of national unions. No study has attempted to compare the relative 
strengths of, and relationships between, these forces. Also, virtually no 
research has compared the determinants of national craft unions with those of 
national industrial unions. More longitudinal studies of organizational struc- 
tural change are necessary to discern the impact of growth and changes in 
other factors, such as industrial, economic, technological, and political con- 
ditions on various dimensions of union structure, including centralization, 
control, and local union autonomy from the national union. Moreover, the 
recent decline of many national unions affords researchers the little used 
opportunity to conduct longitudinal analyses of the impact of union decline on 
structural change (e.g. Cornfield 1989a:170—175). 

Furthermore, post-World War II union decline has stimulated three struc- 
tural innovations beyond hierarchization which have received little sociolo- 
gical attention. The first is the growing number of mergers between unions 
(Chaison 1986). Underresearched are factors such as political conditions that 
may affect the timing of mergers, and factors such as ideology, union type 
(craft or industrial), membership growth trends, and other union traits that 
may affect union affinity toward other unions and union choices of specific 
merger partners. 

A second structural innovation which had accompanied recent union de- 
cline is the spreading of general unionism (Cornfield 1987a), i.e. the recruit- 
ment and organization of workers regardless of their craft or industry of 
employment. Little is known about which types of unions have engaged in 
general unionism, the impact of general unionism on the internal cohesiveness 
and governance of unions, and the effectiveness of general unionism as a 
growth strategy. 

The third structural innovation that has accompanied recent union decline is 
the beginning of associational unionism (Heckscher 1988, Cornfield, 1991), a 
hybrid of a union and an employee association. Differences between the two 
forms of employee organization are fuzzy. More research is needed on 
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differences in the development and nature oi unionism, associationalism, and 
associational unionism, as well as differences in the social characteristics of 
workers and issues which facilitate the development of each type of organization. 


OLIGARCHIZATION Much of the researck on union leadership has ques- 
tioned the theme of inevitable oligarchization in Michels’s iron law. Two 
types of research have addressed the conditions that promote and erode 
oligarchy by investigating the forces that gererate rank-and-file opposition to 
leaders. The first is a long and ongoing line ef research on the determinants of 
individual differences in union member act vism levels (for a review of this 
literature, see Cornfield et al 1990). Lacking theoretical coherence, this line 
of research has found that activism is correlated positively with socioeconom- 
ic status, ideology and class consciousness job satisfaction, and with work 
group homogeneity, among many other find ngs. Three recent research direc- 
tions in this research literature offer promisirg avenues for the development of 
relevant theory: (a) Klandermans' (1984) expansion of resource mobilization 
theory; (b) analyses of group differences, such as race and gender differences, 
in the determinants of activism (Cornfield et al 1990, Hoyman & Stallworth 
1987, Roby & Uttal 1988); and (c) the impact of competition between unions 
and other institutions such as the family, oa union activism (Cornfield et al 
1990). 

The second type of research on sources of rank-and-file opposition to union 
leaders concerns the genesis of organized competition for elective offices 
inside unions. Two branches of research havz predominated in this longstand- 
ing and voluminous research tradition. First, "structuralist" research, inspired 
by the pioneering work of Lipset et al (1956), has addressed the impact of 
formal union structure on electoral competition for union office (Edelstein & 
Warner 1979). Second, the “dynamic” approach consists of theoretically 
diverse, mainly longitudinal analyses that have linked the rise and fall of 
factionalism and opposition inside unions te the business cycle, intensity of 
union-employer conflict, union ideology, .eader-membership ties, and di- 
alectical internal union political processes (Cornfield 1989a, Nyden 1985, 
Schutt 1986). For example, Cornfield's (19892) status conflict theory of 
leadership change holds that lower status etanic and gender groups are most 
likely to succeed higher status groups in leadership positions in ideologically 
universalistic unions, in eras of high union-employer conflict and union 
jeopardization, and when the lower status group is perceived to be of strategic 
importance to the revival of union membe-ship growth. Other longitudinal 
organizational case studies of the underresearched topic of changes in the 
social composition of union leadership are necessary for testing the 
generalizability of the status conflict theorr. 
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Strikes are the chief union action studied by sociologists. Two questions have 
been raised about the determinants of strikes. The first concerns the impact of 
the state on labor militancy. The neo-Marxian thesis is the main one on this 
question, and it holds that the capitalist state tends to reproduce class relations 
by limiting labor militancy and, specifically, by regulating strikes (Fantasia 
1988, McCammon 1990). Unresolved is the debate over the relative im- 
portance of political versus economic determinants of changes in strike 
frequency (on this debate, see the July 1982 issue of the Industrial and Labor 
Relations Review). Korpi & Shalev (1980) attribute the relatively high volume 
of strikes in the United States to the relatively low levels of unionization and 
working class representation in the polity, although the US Bureau of Labor 
Statistics (1989:543) reports a dramatic decline in US strike activity during 
the 1980s. 

The second question is which workers are more or less inclined to strike. 
Research on interindustry variation in strike rates, highlighting facilitating 
conditions, suggests: from neo-Marxian and resource mobilization per- 
spectives that workers are more strike-prone, the more the organization of 
their industry incréases their capacity for disrupting production in their in- 
dustry or in other industries (Cornfield 1985, Wallace et al 1989). Strike 
research based on surveys of individuals suggests that the lower a worker's 
socioeconomic status is, and the stronger his/her employment-related griev- 
ances are, the more supportive of striking a worker tends to be (Cornfield, 
1991, Falk et al 1982, Hodson et al 1987, Schutt 1982). 

Several issues pertaining to strikes are worthy of further research: (a) the 
outcomes of strikes, especially as they are expressed in collective bargaining 
contracts, are a neglected object of research; (b) a synthetic analysis of the 
impact of industry and individual variables on the probability of striking 
would integrate facilitating and motivating determinants in strike theory; (c) 
analyses of which issues precipitate strikes, as opposed to other forms of 
unrest (e.g. quitting) or no unrest, and among which types of workers and 
unions would broaden and refine strike theory; and (d) studies of the impact of 
union variables—e.g. degree of internal democracy, ideology, centralization, 
age of union—on the probability of striking would incorporate the union as a 
central actor in strike theory, instead of continuing its conspicuous absence in 
strike theory. 

Finally, research on the actions of unions must be broadened to include the 
many nonpolitical actions other than striking that have been conducted by 
unions: e.g. collective bargaining, grievance handling, arbitration, union- 
operated welfare programs, labor banks, job referrals, etc. 
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THE SOCIOECONOMIC AND POLITICAL EFFECTS OF 
THE LABOR MOVEMENT 


Unions have attempted to improve worker “ivelihoods in the private market- 
place by organizing workers and engagirg in collective bargaining with 
employers. The results of these marketplace actions are the socioeconomic 
effects of unions. Unions also have sought -o improve worker livelihoods as 
well as to strengthen themselves as institutions by making claims—demands 
for legislation—on the state. The outcomes of these efforts constitute the 
political effects of organized labor. Virtually no research has addressed the 
conditions under which the labor movemen: acts in the marketplace or in the 
polity to accomplish its goals (see Kalleberg & Berg 1987 for a com- 
prehensive discussion of the impact of unions). 


Socioeconomic Effects of the Labor Movement 


The impetus to study the impact of unionism on socioeconomic inequality 
partly derives from the growing interest in tbe structuring of labor markets and 
their impact on the life chances of indiv:duals as Berg (1981:ch. 1) has 
shown. The chief outcome of collective bargaining is a contract—a set of 
written rules governing the workplace ard employment conditions. It is 
through the creation of rules that unions have structured the workplace and 
influenced worker life chances. With the exception of Stepan-Norris, & 
Zeitlin's (1991) innovative analysis of CIO contracts, few recent studies have 
given attention to contracts as a source of data on collective bargaining 
outcomes. Nonetheless, sociological research has addressed the union impact 
on workplace structure and on worker life chances. Few studies have tried to 
discern the confounding influence of "union threat effects" —the efforts of 
nonunion employers to preempt unionization by providing union-scale em- 
ployment conditions—on the impact of unionism on socioeconomic inequal- 
ity (Leicht 1989). 


WORKPLACE STRUCTURE Unions have attempted to alter the authority 
structure of firms in order to increase their participation in managerial deci- 
sion making, as well as to shape the rules by which craft and internal labor 
markets operate. 


Authority structure Unions have participazed to a limited extent in the few 
efforts to modify the conventional pyramida. authority structure of the bureau- 
cratic corporation.) Such efforts, variously termed “worker participation,” 


3See Cornfield (1987:ch. 16), Rothschild-Whitt (1924), and the November 1988 issue of Work 
and Occupations for reviews of the literature on worke- participation and the role of unions in the 
development of worker participation. 
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“employee involvement,” and "quality of worklife,” entail the redistribution 
of managerial authority at all levels of the business enterprise, but mainly on 
the shop floor (Heckscher 1988, Herrick 1990, Kochan et al 1984). Despite 
evidence suggesting that unionism facilitates its establishment (Cornfield 
1987d:ch. 16, Fenwick & Olson 1986), worker participation may be hindered 
by legal obstacles (Stern 1988). Given the important, if minimal, role of 
unions in the development of worker participation, future sociological re- 
search ought to examine variations over time and among unions, worker 
ideologies (e.g. worker control versus pragmatic ideologies), and corpora- 
tions in the conditions that promote worker participation. 

Another underresearched issue concerns the effectiveness of worker 
arficination: i e. da worker narticination schemes constitute a genuine de- 
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ment of internal labor markets, more research is needed on: (a) the relative 
strength of the impact of unions, managers. and the state on the development 
of internal labor markets; (b) which types of unions and which actual con- 
tractual arrangements made through collective bargaining contribute to the 
formation of internal labor markets; (c) the impact of unionization, or the 
threat of unionization, on the recent decline of internal labor markets and the 
development of labor externalization policies, such as the hiring of a con- 
tingent workforce (Pfeffer & Baron 1988), and (d) the impact of unionism on 
the establishment of due process and other formal rights in organizations 
(Dobbin et al 1988). 

Also deserving of more research is the ironic impact of unions on informal 
deviations from the formal internal labor market determined by collective 
bargaining. Finlay's (1988) analysis of longshoremen shows that workers and 
managers made informal “deals” about work hours and productivity that 
directly violated the union contract but that simultaneously served worker and 
company interests. Silver (1986:75—119) fo. ind similar informal arrangements 
in the building trades. Any research project on informal deviations by unions 
and companies from the formal internal labor market, especially projects that 
assess the limits of the impact of formel internal labor markets on life 
chances, would be a welcome addition to the literature. 


WORKER LIFE CHANCES Since Sgrensem's (1983) call for studies of the 
impact of unions on labor market outcomes or life chances, sociological 
research has addressed three relevant issues. The first is the effect of unions 
on life chances: i.e. are the life chances and socioeconomic status of union 
members (or workers covered by a collective bargaining agreement) superior 
to those of nonunion workers? It is typically argued, either from a neo- 
Marxian "worker solidarity" perspective or rom a pluralist resource mobiliza- 
tion perspective (e.g. Kalleberg et al 1981), that unionization strengthens the 
capacity of workers to improve their livelihoods and, therefore, that the 
livelihoods of unionized workers are superior to those of nonunion workers 
(see Freeman & Medoff 1984 for an economic argument). Unionism or labor 
militancy is positively associated with individual and aggregate income (e.g. 
Beck & Watson 1988, Freeman & Medoff 1984, Griffin et al 1989, Kalleberg 
et a] 1981, Rubin 1986, Wallace et al 1939). However, most sociological 
studies of union effects on life chances treat unionism as one of several 
institutional causes of inequality in life chences, rather than make unionism 
the central focus of the analysis. Consequently, the mechanism by which 
unionism could effect any specific labor narket outcomes, as well as the 
diverse union strategies for doing so, have been neglected both theoretically 
and empirically. 

The effect of unionism on fringe benefits, such as pensions and health 
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insurance, is ambiguous. Unionized workers are more likely than nonunion 
workers to be covered by various fringe benefit plans (Freeman & Medoff 
1984, Krecker & O'Rand 1991, O'Rand 1986, Perman & Stevens 1989, 
Wallace et al 1989). However, in their study of company pension plans, 
O'Rand & MacLean (1986) found that the lower benefits of pension plans in 
unionized companies seemed to be attributable to the wider age participation 
rules of these plans compared to the plans in nonunion companies. This 
suggests that unions may trade off pension benefits for wider accessibility of 
the pension plan to more workers. Also, Martin et al (1988) found that female 
labor leaders valued work-family policies more highly than did male labor 
leaders. This suggests the necessity of more research on the impact of 
unionism on the content of fringe benefits, in addition to fringe benefit 
Coverage rates, and on the actual bargaining goals and trade-offs used by 
different unions and union leaders in negotiations with employers about the 
content of fringe benefit plans (e.g. Quadagno 1988b). 

Research has uncovered inconsistent findings on the impact of unionism on 
job security and unemployment (Cornfield 1987b, D'Amico 1984, Freeman 
& Medoff 1984). Future research must carefully control for the many industry 
traits that may be correlated with the extent or presence of unionization and 
labor demand in the industry or firm and must also examine temporal varia- 
tions in the relationship between unionism and job security. Given that few 
unions directly control the number of layoffs, organizational case studies of 
layoffs at unionized firms would help to determine how, if at all, other union 
bargaining goals and contract provisions, including crew or work group size 
limitations, hours and wages, affect job security. Finally, future research 
must distinguish between different reasons for job loss—e.g. technological 
change (Cornfield 1987d), temporary layoffs during recessions, permanent 
layoffs resulting from a plant shutdown, and voluntary quitting—in order to 
discern any union effect on job security. Indeed, several studies indicate that 
unionism reduces voluntary quitting (Cornfield 1985, D'Amico 1984, Free- 
man & Medoff 1984:94—100, Halaby 1986), perhaps because unionized 
workers are more satisfied than other workers with their jobs (Pfeffer & 
Davis-Blake 1990). 

The few studies of the impact of unionism ón occupational employment 
segregation yield inconclusive findings (Bielby & Baron 1986, Bridges 1982, 
Kaufman 1986, Milkman 1987). Given that labor allocation is typically 
performed by management rather than by unions, more research is needed on 
how allocative provisions of collective bargaining agreements may constitute 
institutional discrimination against minorities and women and thereby create 
occupational employment segregation, as well as which of these provisions 
are favored more by unions or employers and how they differ from nonunion 
allocative arrangements. This research is necessary.to discern any impact of 
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unionism, as opposed to employer influences, on occupational employment 
segregation. 

An important but neglected research topic is the impact of unionism on the 
rates of inter- and intra-generational social mobility. If "blocked mobility" is 
one reason why workers unionize and is a factor in shaping working class 
political action, research on the impact of vnionism on social mobility could 
produce important insights into the capacity of the labor movement to repro- 
duce itself inter- and intra-generationally, end into the shaping of organized 
labor's political action. 

The second issue concerning unions and labor market outcomes is variation 
among worker subgroups in the return to union membership. Most relevant 
research has been limited to an examinatien of the union rate of return in 
terms of income, neglecting other socioeconomic indicators and instances of 
life chances. Concerning income, inconclusive research findings exist for 
variation in the union rate of return among worker subgroups defined by 
several indicators of socioeconomic status, including race, gender, skill level, 
economic sector, and occupation (Beck & Watson 1988, Coverman 1986, 
Form 1985:124, Freeman & Medoff 1984:49, Kalleberg et al 1981, Rexroat 
& Shehan 1986, Rosenfeld & Kalleberg 1390, Villemez & Bridges 1988). 
Inconclusive findings also exist for temporal changes in the union rate of 
return in income (D'Amico & Daymont 1932, Freeman & Medoff 1984:52— 
54). Quadagno's (1988b) historical compa-ison of women’s access to pen- 
sions in the automobile and garment indus-ries is one of the few studies of 
variation in the union rate of return in terms of fringe benefit coverage. 

The third issue concerning union effects on labor market outcomes is the 
impact of unionism on the degree of socioeconomic inequality. Studies of 
wage dispersion indicate the equalizing effects of unions on income inequality 
among firms, geographical areas, and sk ll levels (Form 1985:122, 129; 
Freeman & Medoff 1984:78—93; Pfeffer & Ross 1981). However, Wallace & 
Kalleberg's (1982) analysis of craft uniorism suggests that separate craft 
unions within the same firm or industry, often engaging in separate collective 
bargaining negotiations with the same employer(s), may actually preserve or 
promote occupational skill gaps in pay. 

Furthermore, the impact of unionism on socioeconomic inequality is com- 
plicated by the on-going efforts of many un.ons to rationalize with universal- 
istic criteria such as seniority what had been capricious managerial decision- 
making in labor allocation and compensation (Cornfield 1987b). Nonetheless, 
Beck (1980) argues, in the case of racial inequality in wages, that while 
some unions have opposed racial discrimination because it weakens worker 
solidarity, others, especially craft unions, have attempted to protect white 
workers’ wages by segregating and isolating blacks in the labor market. This 
suggests that some unions have embraced universalistic workplace rules 
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more fully than others and that, rather than reducing socioeconomic inequality 
among workers, unions may have substituted a universalistic basis of inequal- 
ity for more particularistic ones. 

Research on any universalizing effects of unionism on the determinants of 
socioeconomic inequality is inconclusive. The research indicating such uni- 
versalizing effects shows that inequality in layoff chances is more likely 
attributable to seniority among union workers than it is among nonunion 
workers (Cornfield 1987b); worker ability is a more important determinant of 
upward mobility chances for nonunion than for union workers (Sgrensen & 
Tuma 1981); unionization reduces the gender gap in health insurance cover- 
age (Perman & Stevens 1989); and education is a more important income 
determinant for nonunion workers than it is for union workers (Form 
1985:129, Pfeffer & Ross 1981). However, a few studies suggest that senior- 
ity is a more important income determinant for nonunion workers than for 
union workers (Pfeffer & Ross 1981), perhaps suggesting the existence of 
union threat effects on labor compensation practices in nonunion firms. 
Research on the impact of unionism on racial inequality in income is also 
inconclusive (Beck 1980, Form 1985:129, Pfeffer & Ross 1981). 

Our understanding of any universalizing effects of unionism on the deter- 
minants of socioeconomic inequality would greatly benefit from comparative 
studies of actual collective bargaining agreements and the personnel practices 
of nonunion companies. Such studies would illuminate any differences in 
union and management preferences for labor allocation and compensation 
criteria and indicate how these differences may contribute to institutional and 
direct discrimination by race, sex, and other nonuniversalistic criteria. 


Political Effects of the Labor Movement 


There are two separate bodies of research on labor's political action. First, 
pluralist research attempts to explain the political effectiveness of labor as an 
interest group. Second, research on the development of the welfare state has 
analyzed how labor, capital, and the state shape social policy. 


PLURALIST RESEARCH From a pluralist perspective, organized labor as a 
whole, as well as individual unions, are among many competing interest 
groups that attempt to influence the enactment, by an otherwise neutral state, 
of public policies consistent with their specific, interest group goals. In 
pluralist research, union efforts to shape public policy, like those of other 
interest groups, are constrained mainly by the capacity of unions to influence 
the behavior of voters and incumbent, elected public officials. In light of 
Masters & Delaney's (1987) comprehensive review of pluralist research on 
labor political action, I turn to suggestions for future pluralist research (also 
see Form 1985:211—214 and Freeman & Medoff 1984:191—206 for reviews). 
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Several pluralist issues about labor political action remain underresearched. 
The first is the origins of labor's legislative agenda. Cornfield (1989b) argues 
that union support for different types of putlic policy depends on membership 
socioeconomic status and membership growth rates. This analysis ought to be 
extended to several eras of labor history. Second, pluralist research has 
neglected analysis of union impact on politizal party platforms. Third, virtual- 
ly no research has addressed the impact of coalitions between unions and 
nonunion groups on union electoral and legislative success rates. Fourth, the 
degree to which union electoral and legislat ve successes result not from union 
“clout” but from union backing of “wirners” is unknown. Fifth, union 
electoral and legislative success rates have not been compared to those of any 
other interest group, including those of labor's antagonists. 


LABOR AND THE WELFARE STATE The impact of organized labor on the 
development of the US welfare state has been studied in conjunction with 
research on the nature of the capitalist state and, specifically, on the influence 
of the capitalist and labor classes, as well as the state itself, on the develop- 
ment of US social policy. Differences in the prevailing theories—i.e. the elite 
(Domhoff 1986-1987, 1987), neo-Marxiam (Levine 1988, Quadagno 19882), 
and state-centered (Orloff & Skocpol 1984) theories—focus mainly on the 
relative impact of the capitalist class (or class fractions) versus the state on 
social policy and are, therefore, beyond tie scope of this review. 

Regardless of theoretical perspective, raost research suggests that orga- 
nized labor was too weak to influence US social policy before and during the 
New Deal. According to Pavalko (1989), labor militancy had no impact on 
the timing of the enactment of state workmen's compensation laws during the 
1909-1929 era, perhaps because, as Orlof= (1988) implies, organized labor, 
despite its advocacy of workmen's compensation, was unable to enter a 
“cross-class coalition" with middle-class reformers during this era (also, see 
Orloff & Skocpol 1984 and Skocpol & Ikenberry 1983). Indeed, labor's 
involvement in a cross-class coalition strongly influenced the enactment of the 
Ohio old age pension law in 1933 (Quadagao & Meyer 1989). More research 
on variations in organized labor's impact on social policy in state and local 
government—-over time, geographically, aad by type of union (e.g. craft or 
industrial)—would illuminate how labor may have influenced pre-New Deal 
social policy. 

Recent research suggests that organized ‘abor had little or no impact on the 
enactment of the Social Security Act of 1935 (Domhoff 1986-1987, Quad- 
agno 19882, Skocopol & Ikenberry 1983). Jenkins & Brents (1989), howev- 
er, argue that protest and labor militancy stimulated the passage of the act 
(also, see Greenstone 1969). Amenta et al. (1987) have argued that state 
unemployment insurance laws, a component of the Social Security Act, were 
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influenced by organized labor, especially where labor was aligned with 
reformers and non-patronage oriented, cohesive state Democratic parties. 

Unresolved is the nature of any impact organized labor may have had on the 
National Labor Relations (Wagner) Act of 1935, which prohibited employer 
interference in, and established US government supervision of, labor union 
organizing. Neo-Marxian Levine (1988: ch. 6) suggests that rising labor 
militancy influenced the passage of this act. Historian Tomlins (1985), who 
claims his analysis is consistent with some neo-Marxian arguments (pp. 
xiii-xiv), shows that although the AFL was supportive of the bill, the AFL 
was unsuccessful in its effort to limit government regulation and to preserve 
union control of organizing by attempting to amend earlier versions of the bill 
(Tomlins 1985:132-140). For Domhoff (1987:180), the Wagner Act con- 
stitutes an "arrangement between the general leaders within the power elite, 
with the tacit support of Southern politicians, and the leaders of the unskilled, 
industrial worker segment of the working class," effectively assigning a 
subordinate influence on the passage of the act to only one segment of 
organized labor. Finally, organized labor is virtually irrelevant to Finegold & 
Skocpol's (1984) state-centered analysis of the Wagner Act. 

In sum, research is inconclusive about the role of labor in the development 
of the US welfare state before and during the New Deal. To discern this role, 
it is necessary to analyze: (a) the origins of labor's full legislative agenda, 
examining variations in public policy goals over time, geographically, and 
among unions (Cornfield 1989b); (b) the social, political, and economic 
conditions under which various branches of organized labor were more or less 
successful in influencing the enactment of diverse prolabor laws; and (c) the 
kinds of groups with which organized labor may have had to align in 
coalition, as well as the conditions conducive to coalitions, to influence the 
enactment of public policy successfully. 

Studies of the union impact on welfare state development since World War 
II have focused on changes in federal government spending levels, rather than 
on the enactment of law (for an exception, see Donnelly 1982). Most studies 
of federal government spending levels have been neo-Marxian, time-series 
analyses which argue that to ensure capital accumulation the state increases its 
expenditures—e.g. military and social welfare expenditures—to stimulate 
employment during eras of rising labor militancy or when the potential for 
such militancy increases along with increases in the union unemployment rate 
(e.g. Griffin et al 1982, 1983; Hicks 1984). In contrast to these time-series 
analyses, future time-series analyses should not assume a constant capacity of 
organized labor to influence federal spending, given the decline in the per- 
centage unionized over the post-World War II era. Also, given this decline, 
the measure of “union unemployment rate,” operationalized as the unemploy- 
ment rate for several broad industry classifications, probably reflects the 
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unemployment rate of mainly nonunion werkers, even if union workers are 
overrepresented in these broad industry classifications. 

Regarding social policy developments sirce World War II, few researchers 
have analyzed the three major legislative setbacks for organized labor: i.e. the 
Taft-Hartley Act of 1947 and related state rizht-to-work laws (Canak & Miller 
1990); the Landrum-Griffin Act of 1959; and the failure of the Labor Law 
Reform bill of 1977-1978 (Cornfield 1982b, Tomlins 1985). Virtually no 
research has examined labor's impact in such policy domains as civil rights, 
public education, tax, social welfare, and labor law. 


CONCLUSION 


If research on the causes and consequences of the labor movement is to 
contribute to a sociological theory of labor action, it ought to be redirected in 
three ways. First, labor organizations—e.g. federations and unions—should 
be considered institutional actors whose structures, goals, ideologies, and 
actions are deserving of scholarly attentiom. In contrast, much research has 
demonstrated links between unionism and socioeconomic and political out- 
comes but has failed to specify organizec labor's diverse motivations and 
means for realizing such outcomes. 

Second, labor organizations or "organized labor" should be treated as a 
diverse set of actors. At a minimum, differences between craft and industrial 
unions in ideology, structure, tactics, end social composition make a 
monolithic image of organized labor theoretically and empirically in- 
appropriate. 

Third, publicly accessible primary data on labor organizations as in- 
stitutional actors ought to be incorporated iato labor research. Unlike private 
corporations, which are often secretive about their internal operations, 
unions, as formally democratic membership organizations, produce primary 
data which is accessible in libraries and labor archives.* For example, analy- 
ses of actual collective bargaining agreements are surprisingly neglected in 
studies of the union impact on socioeconomic inequality. Other primary data 
sources include union constitutions, union newspapers, verbatim convention 
proceedings, and union executive council reports which often include 
membership and financial data. 

Redirecting research in these ways wilI place labor organizations in the 
center of labor research. With this emphasis, it will become increasingly 
feasible to theorize about the actions of ozganized labor. 


4See the Winter-Spring 1990 issue of Labor Histor for an exhaustive review of labor archives 
in the United States. 
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Abstract 


We review the literature on the economic consequences of marital dissolution 
for women. Longitudinal studies of the effects of divorce and widowhood 
indicate that both types of dissolutions have negative and prolonged con- 
sequences for women’s economic well-being. This is not the case for men, 
where marital dissolution often leads to an improved economic standard of 
living. Following an examination of empirical studies and measurement 
issues in the divorce and widowhood literatures, we describe preexisting and 
direct sources of women’s postdissolution economic insecurity. Women’s 
postdissolution economic hardship is due to multiple interrelated factors, 
often only superficially coupled with the marital dissolution event. In particu- 
lar, the division of labor during marriage, lower wages paid to women both 
during and after marriage, and the lack of adequate postdissolution transfers 


"This paper represents a truly shared effort of the two authors. The ordering of names is 
alphabetical and is not intended to indicate the greater effort by one. We gratefully acknowledge 
the advice of Sara McLanahan at several key stages of our writing. 
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to women imply that unless changes in women's work roles are mirrored by 
social policy initiatives and men's assumption of equal responsibility for 
children (both within and out of marriage), 2conomic prospects for previously 
married women will remain poor. 


OVERVIEW 


There is little dispute that women without hisbands are worse off economical- 
ly than their married peers or than they themselves were when still married. 
Households headed by women, whether by young mothers or elderly women 
(typically without children), are more likely to be poor than households 
headed by men of similar age. Numerous s-udies provide the statistics, which 
need not be repeated here (see McLanahan zt al 1989, Sawhill 1988). Rather, 
the purpose of this chapter is to review the literature that attempts to answer 
the question of why women whose mariages dissolve through death or 
divorce suffer a sharp decline in economé& status. 

It is important to start the discussion by recognizing that the standard 
measures of well-being treat all family members who live together in one 
household as an economic unit within wiich resources are assumed to be 
equally shared. By these assumptions wives cannot be worse off than their 
husbands—unequal market earnings (or otter income receipts) are assumed to 
convert into equally accessible income in the family unit. Similarly, children 
cannot be worse off than the parents with whom they reside. 

When marriages dissolve through divorce any inequality in the sharing of 
resources can be directly observed. Following divorce, it is assumed that no 
sharing occurs between separated parents o- between noncustodial parents and 
their children unless income transfers (or ezpenditure payments) are explicitly 
reported. Divorced status defines separate consumption units. 

Marital dissolution through death, on the other hand, leaves only one 
consumption unit. In widowhood, sharirg of income can extend beyond 
marriage only through a transfer to the s.irvivor—through insurance or in- 
heritance—of resources previously paid sclely to the deceased spouse. In the 
absence of such transfers, the economic satus of the widow is substantially 
lowered. 

The “story” that the marital dissolution literature seeks to explain is why 
equal sharing between marriage partners ceases upon marital dissolution. One 
may conclude that in divorce sharing ceas2s when “caring” does and that the 
spouse with the larger income attempts tc retain a disproportionate share of 
the former household’s total resources. Emotional alienation leads to sharp 
economic division. 

. A similarly facile explanation is not evailable, however, in the case of 
widowhood. Husbands are presumed to cere for their wives, and, in contrast 
to the situation in divorce, an active insurance market has developed to enable 
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husbands to ensure the continued well-being of the widow. Yet, as we show 
here, the widowhood literature finds declines in economic well-being for 
newly widowed women similar in magnitude to those for women who become 
divorced. 

What are the common threads that bind these women to a common experi- 
ence as their marriages dissolve? In this chapter we first review the literature 
on divorce and next on widowhood; we reflect on what conclusions drawn 
from each tell us about the economic position and power of women as well as 
about the role of marriage in reducing or increasing their economic vulnerabil- 
ity. 


EFFECTS OF SEPARATION AND DIVORCE ON 
WOMEN'S ECONOMIC WELL-BEING 


Numerous longitudinal studies concur that separation and divorce have detri- 
mental economic consequences for women. These studies track economic 
well-being through time, comparing predivorce to postdivorce economic 
status. Three central themes emerge from this literature. First, despite a wide 
range of estimates across studies and subgroups of women, there is remark- 
able consensus that divorce is associated. with economic decline for women. 
Second, gender differentials are marked, with men generally better off 
following separation and divorce, relative both to women and to their pre- 
disruption standard of living. Third, unless women remarry, the economic 
deterioration they experience is likely to be prolonged. 


Short-Term Effects 


Longitudinal studies typically employ one of the following measures to 
document the economic consequences of separation and divorce: poverty 
rates, family income, per capita income, and the ratio of income-to-needs. 
Poverty rates among women in the year following separation or divorce are 
uniformly higher than during marriage (Nestel et al 1983, Duncan & Hoffman 
1985, Morgan 1989). Precise estimates vary among studies, and they are 
higher for subgroups with higher predisruption rates. For example, Duncan & 
Hoffman estimate that only 4% of white women who had above-median 
family incomes while married are poor in the postdisruption year. But poverty 
rates are substantially higher for women with higher predisruption rates; one 
third of black women and one fifth of white women with below-median 
incomes during marriage are poor in the year following marital disruption. 
Nestel et al (1983) estimate that postdisruption poverty rates range from 25 to 
4596, depending on race and whether the woman is separated or divorced. 

Though an important indicator of deprivation, the poverty rate focuses on a 
single point in the income distribution; it is silent on the magnitude of 
economic change and insensitive to income change that remains above or 
below the poverty threshold. Table 1 summarizes the results of studies that 
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estimate change in at least one of the cther three measures: total family 
income, per capita income, and the income-to-needs ratio. 

Panel A shows estimates from seven studies of change in total family 
income surrounding the time of separaton or divorce. All indicate that 
women undergo marked declines in famil; income. Three studies suggest a 
30% decline (Hoffman 1977, Mott & Moore 1978, Duncan & Hoffman 
1985), and four report somewhat higher declines, with the average being 40 to 
50% (Corcoran 1979, Nestel et al 1983, Weiss 1984, Weitzman 1985). 
Where a range of estimates is reported, th s reflects variation among catego- 
ries of women.! The steepest declines in fanily income occur for women with 
the highest family incomes during marrizge, and for black women. Weiss 
estimates that women who were in the tcp third of the income distribution 
prior to disruption experience a 55% drop ia income, while those in the lowest 
third of the income distribution experience a 23% decline. Similarly, Weitz- 
man reports that women with the highest pr=disruption family incomes experi- 
ence a 71% decline in family income compared to a 29% decline amone those 
with the lowest predisruption incomes. When Duncan & Hoffman's data are 
disaggregated by race, black women evidence even sharper income declines 
than do white women who had been relatively well-off during marriage; black 
women's incomes are reduced by almost one half. 

Panel B presents estimates of change in per capita income, a measure that 
implies no returns to scale with additional family members. Reductions in per 
capita income among women following separation and divorce are generally 
in the range of 20 to 35% (Mott & Moore 1978, Nestel et al 1983). Weitz- 
man's (1985) estimates range from +16% to —58% with most women 
experiencing declines between 20 and 50%. The most dramatic declines occur 
for women who had been in marriages of long duration and who had higher 
income prior to divorce. 

Estimates of change in income-to-needs are shown in Panel C. Needs 
standards take into account returns to scal= associated with larger household 
size and, typically, the different consumotion needs of younger and older 
individuals. Four studies use the official poverty thresholds to measure needs 
(Hoffman 1977, Corcoran 1979, Duncan & Hoffman 1985, Stirling 1989), 
while Weitzman (1985) uses the moderately higher Lower Standard Budget of 
the Bureau of Labor Statistics.? 


IStudies variously present overall averages, and/or averages by race, predisruption income 
status, separated vs divorced status, duration of marrizge, or some combination of characteristics. 

?Other equivalency scales might lead to differentresults. Lazear & Michael (1980) estimate 
larger returns to scale, implying that the poverty scale underestimates needs of one- versus 
two-person households. The budget share index estimated by van der Gaag & Smolensky (1982) 
estimates more modest returns, implying an overesimate by the poverty scale of the relative 
needs of one- versus two-person households. See S3rensen (19912) for an examination of the 
` effects of different needs standards on estimates of the consequences of divorce. 
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Estimates of declines in income-to-needs experienced by women following 
separation or divorce range from a low of —6.7% (Hoffman 1977) to a high of 
—73% (Weitzman 1985). Corcoran (1979) and Stirling (1989) report declines 
in income-to-needs of approximately 20 to 30%. While Duncan & Hoffman 
report an overall decrease of only 13%, they show considerable variance 
among women, with both black women and white women who were relatively 
well-off during marriage experiencing a decline in income-to-needs of 20 to 
30%. 


Evidence of Gender Differentials: The Unequal Costs of 
Divorce 


There is no simple answer to the question of how much decline in economic 
status women will experience following separation and divorce. Some decline 
in economic well-being is expected when one household becomes two simply 
due to losses in economies of scale. What is problematic is the overwhelming 
evidence, which we review below, that men—unlike women—experience an 
increase in economic well-being upon divorce. In theory, a disparity in 
outcomes between divorced men and women could be explained merely by 
gender inequalities in earning power. However, to the extent that children are 
involved in a separation and divorce, it is more difficult to explain this 
disparity. In the case of divorce, children have two living parents that 
presumably care about their economic welfare. We might therefore expect 
that the standard of living in postdivorce households with children would 
more often reflect that of the nonresidential parent, who is generally the 
father. l 

Burkhauser & Duncan (1989) provide a vivid illustration of the gender 
differential in the costs of marital disruption. Using data from the Panel Study 
of Income Dynamics (PSID), they estimate that a quarter of women ages 
26—35 who separate or divorce will experience a drop in the ratio of income- 
to-needs of at least 50% which will bring their final income-to-needs ratio to 
1.5 or less (no more than 50% above the poverty threshold). Among men, the 
analogous risk is 496 (see also David & Flory 1988). Three of the studies in 
Table 1 analyze changes in the economic well-being of men upon separation 
or divorce. Like women's, men's family incomes decline due to the loss of 
earnings of their ex-spouses, but not by as much as it does for women. 
Hoffman's (1977) estimates indicate a 19% reduction, while Duncan & 
Hoffman (1985) report an average decline of just 796.? In fact, men's incomes 
rebound to slightly above their predisruption level by the second year after 
divorce. Like women also, men who had relatively high family incomes while 
married experience greater declines (Duncan & Hoffman 1985, Weitzman 
1985). 


?Duncan & Hoffman's estimates reported in Table 1 are those for women who do not remarry. 
Estimates for men include the experiences of remarried men. 
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Available evidence indicates that, unHke women's, men's standard of 
living improves following separation or Civorce. Hoffman (1977) estimates 
that the income-to-needs ratio of separated and divorced men increases by 
16.5%, even after deductions for alimony and child support payments. Dun- 
can & Hoffman (1985) report an increase cf 1396 in income-to-needs between 
the year before marital disruption and the year after disruption, with all 
subgroups of men in their sample experiencing an improvement. By the 
second year after disruption, men's average income-to-needs rise to 2446 
above predivorce levels. Weitzman's (1985) data suggest that, even after 
deductions for child support and alimonz awards, men experience a 42% 
improvement in income-to-needs. 

How can this evidence be reconciled with the economic hardship that 
women experience after divorce? Rathe- than promoting equality in the 
postdivorce standard of living between men and women, children appear key 
to this inequality; more precisely, the differential responsibility for children 
undertaken by men and women is crucia. First, few men assume physical 
custody of their children following separation or divorce. This translates into 
a substantial decrease in "needs" for fathers. Second, only a minority of men 
fully comply with child support awards, end, even when they do, payments 
are meager (US Bureau of the Census 1989). Third, husbands tend to earn 
more than wives (Sorensen & McLanahan 1987, Treas 1987), which is related 
to gender inequities in nonmarket work and childcare both during marriage 
and after divorce. The upshot is that men retain a larger share of the family's 
income, while their economic needs decrease more than their income. 


Longer-Term Effects of Divorce: Do Women Recover? 


If declines in women’s economic status ere short-lived, concerns about the 
effects of marital disruption might be overstated. Some recovery might be 
expected over time for mothers as children age, freeing up time for market 
work. Unfortunately, direct evidence regarding this possibility is scarce; the 
longitudinal studies generally trace economic status for just the first five years 
following separation and divorce and, es we have seen, they categorize 
women only by race and predivorce ecomomic status. Some insight comes 
from Peterson (1989) and Duncan & Hoffman (1985). Peterson finds evi- 
dence that women divorced at least 10 yzars have significant earnings and 
income advantages over recently divorced women. Since the number of 
children in the household has a negative effect on the labor supply of the 
women in his sample, this is consistent with the notion that any long-term 
improvement in economic well-being among divorced women might be re- 
lated to the maturing and departure of chilcren from the household. Duncan & 
Hoffman include a sample of children of civorcing parents in their study. On 
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average, there is no improvement in children's income-to-needs ratios in the 
first five years following marital disruption. 

Most longitudinal studies indicate that women's economic vulnerability is 
prolonged for at least five years, unless remarriage occurs, and despite 
dramatic increases in labor force participation and hours worked upon separa- 
tion and divorce (Hoffman 1977, Mott & Moore 1978, Corcoran 1979, 
Espenshade 1979, Moore 1979, Weiss 1984, Duncan & Hoffman 1985, 
Peterson 1989, Stirling 1989). Earnings constitute the primary source of 
income for most separated and divorced women; while some subgroups of 
women manage to increase their incomes slightly over time, this does not 
bring them near predisruption income-to-needs levels. Weiss (1984) reports 
little increase in women’s family income between the first and fifth year 
following marital disruption, although there is some variation by income 
status before disruption. For women who had been in the top third of the 
income distribution while married, the postbreak/prebreak income ratio is .45 
in the first year following disruption, increasing to .50 by the fifth year after 
disruption. For those in the middle income category, the ratio is .56 in the first 
postdisruption year, rising to .64 by the fifth year. Lower income women 
experience no income improvement in the five year period. Duncan & 
Hoffman's (1985) study similarly attests to the persistence of income de- 
clines; over the five years after disruption, women's average income increases 
only from 70 to 71% of predisruption family income. Again, differences 
across subgroups are marked; black women's postdisruption to predisruption 
income increases from 54 to 74%. Some recovery is also apparent for white 
women who were above the median income level prior to disruption, but 
white women with below-median incomes before disruption suffer further 
income deterioration. 

Taking account of changes in consumption needs over time, Stirling (1989) 
reports no improvement in income-to-needs during the five years following 
marital disruption. Duncan & Hoffman (1985), who report a relatively Jow 
fall in this ratio following divorce, report only a moderate increase in income- 
to-needs, from .87 to .94 of predisruption levels during the first five years 
after divorce. 

Studies agree that remarriage is the most likely route to economic recovery 
(Nestel et al 1983, Duncan & Hoffman 1985, Peterson 1989). Duncan & 
Hoffman estimate that among women who do not remarry income remains at 
30% below its predisruption level for at least five years following separation 
or divorce. But among women who do remarry, family income is 27% higher 
than predisruption income in the fifth postdivorce year, comparing favorably 
to the income growth experienced by couples married throughout the period. 
Similarly, among remarried women income-to-needs by the fifth postdivorce 
year is 2596 higher than predisruption levels. Since remarriage rates have been 
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declining since the late 1970s (Sweet & Bumpass 1987), and there are marked 
differentials, particularly by race and age; in the likelihood of remarriage, 
remarriage cannot be viewed as a general means to economic recovery. 


EFFECTS OF WIDOWHOOD ON WOMEN'S ECONOMIC 
WELL-BEING 


Despite changes over the past two decades in pension and social security 
policy that have sought specifically to improve the economic security of 
widows, longitudinal studies show that the 2conomic consequences of widow- 
hood are no less detrimental to women than are those of divorce and separat- 
ion. The studies reviewed here are those that use panel daia to follow 
individual women from being married into widowhood. We ignore the larger 
share of the literature on widows that infer the consequences of marital status 
change from analyses of cross-sectional or time-series data. The major con- 
clusions from the longitudinal series are, fi-st, death (widowhood) is selective 
of relatively low income men (wives); second, the event of widowhood is 
associated, nevertheless, with a major ckange in economic well-being for 
women; third, husbands fail to adequately insure their wives against the 
economic consequences of widowhood; and fourth, gender differences are 
evident as men suffer little or no declime in economic well-being when 
widowed. 


Short-Term Effects 


Measures of economic change when women are widowed typically compare 
poverty status or income-to-needs during a-year early in widowhood to that in 
the last year the husband was alive. The “skort-term” period depends upon the 
inter-survey period for the panel, which is two years in the Retirement History 
Study (RHS), one year in the Panel Study of Income Dynamics (PSID), and 
as short as four months in the Survey of ncome and Program Participation 
(SIPP). 


Using selected years of data from the RHS, Hurd & Wise (1989) report an 
increase in poverty rates from 9 to 42% es women are widowed. Including 
data from all RHS interviews (taken biaanually between 1969 and 1979) 
Holden et al (1986) report a similar rise im poverty rates from 9-16% in the 
last year of marriage, to 38-50% in the first year of widowhood. While 
Burkhauser et al (1986) show that poverty during the first year of widowhood 
may have been overestimated in both studies, even after appropriate income 
adjustments, poverty among this group ef women is estimated to almost 
double, from 14% to 26% (Holden 1989). 
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Because poverty among women already poor when married cannot by 
definition be associated with the event of widowhood, Zick & Smith (1986) 
analyze the income path for widows in the PSID who were not poor just prior 
to widowhood; 17% entered poverty in widowhood, a percentage that is 
consistent with data reported for initially nonpoor widows by Holden et al 
(1988). Among women, those whose husbands died during the 1984 SIPP 
panel, Holden (19892) finds that poverty rates rose from just 6% when 
married to 1596 immediately after the husband's deaths. 

A substantial fall in income is a hazard faced not only by the poor. 
Examining that risk across income groups shows that greater prewidowed 
income protects against poverty (which can be entered only with a large 
absolute income decline), but that higher income women are more likely to 
suffer a substantial decline in income-to-needs when widowed (Holden 
1990b, Burkhauser et al forthcoming, Smith & Zick 1986). Holden (1989b) 
finds that women who eventually became widows in the RHS (widowed 
between 1969 and 1979) suffered a decline in mean income-to-needs by one 
third. Almost identical estimates are reported by Zick et al 1991, (forthcom- 
ing) for PSID women widowed between 1968 and 1984 and Holden (1990b) 
for women widowed during the 32-month survey period of Survey of Income 
and Program Participation. 

Measured changes in average poverty rates and income mix the experience 
of women widowed at different ages. Consider the lifetime income profile of 
a hypothetical couple. Prior to retirement the husband (and perhaps the wife) 
work. At age Ro the worker(s) may phase into retirement, and at age R, all 
paid work ceases. Both spouses will then receive social security benefits and, 
perhaps, pension income. Life tables peg the husband's expected death at age 
D,, after which the wife receives her own survivor benefits (or retired-worker 
benefits, if higher) from social security and, perhaps, a 50% survivor benefit 
from the husband's pension. 

When the husband dies at age D,, the widow is insured through social 
security and private pensions against the loss of her husband's retirement 
income. A woman widowed prior to her husband's retirement is subject to the 
same system——survivor benefits to her will be the same share of the retirement 
benefits for which her husband's earnings history would entitle him. Unless 
insured through other means, younger women are likely to suffer a larger 
absolute and percentage decline in income—the combined effect of a fall to 
"retirement" income and the fall to survivor benefits. 

The influence of age has been an important variable in descriptive analyses 
of widowhood. Hurd & Wise (1989) found no relationship between husband's 
age at death (which could only be 59-71 in the RHS) and the chances of a 
widow's being poor. Others find, however, that age of wife at widowhood 
does matter, with those widowed before age 60 more likely to become poor 
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(Holden et al 1988, Burkhauser et al fortkcoming, Smith & Zick 1986, Zick 
& Smith 1986)—evidence of the protection from age-conditioned insurance 
(in which it is the age of the survivor thet matters) and of the labor market 
constraints faced by younger widows with children. Holden (1990b) and Zick 
et al (1991—forthcoming) find that while older and younger widows experi- 
ence much the same change in income-to-needs, the former suffers a loss in 
husband's retirement income (which megy be insured through age-related 
social and private insurance) while the latter suffers a loss in his earnings 
which may be insured only through private life insurance and through social 
security if she has a minor child. Thougr based on SIPP's small sample of 
widows, Holden's (1990b) results imply that it is young widows without 
children—and therefore unprotected by social security—whose short-term 
income fall may be most severe. 


The Unequal Costs of Widowhooa: Evidence of Gender 
Differentials 


Judgments about the “seriousness” or public policy implications of income 
change observed among widows, requires some sense of what should happen. 
In contrast to divorce no decline in well-being need be expected; after 
widowhood there is still only one household and that household is smaller. 
Thus, if all income previously paid to the couple converts to the survivor—a 
possibility since the deceased husband need not and cannot, literally, take it 
with him—measures of economic well-being that consider both income and 
needs would show a gain in the income status of the household. But as 
described above, that is not the case. On average widows are left with much 
less income than is necessary to maintain the same standard of living enjoyed 
when married. Comparisons with otherwise similar couples in which no death 
occurs show that these declines are not the result of more general economic 
conditions. 

Studies that compare the experience of widows and widowers show that the 
economic experience of the widowers differs from that of widows. One of the 
earliest studies, by Zick & Smith (1986), reported widowed men as equally 
likely to suffer a decline in status as were widows, but the number of men in 
their sample (27) made this conclusion tentative at best. A larger sample of 
widowers in Zick et al (forthcoming) sho.v that older widowers suffer a far 
smaller decline in income, primarily from- the loss of wives’ social security 
benefits. Burkhauser et al (1991—forthcoming) and Holden (1990a) show 
that while the needs-adjusted income of women falls sharply when their 
husbands die, men’s incomes remain stable upon the deaths of their wives. 

Women suffer a large fall in income when their husbands die for largely the 
same reasons women suffer upon divorce Similarly, the disparate effect on 
women and men of divorce and widowhood have like explanations. It is 
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useful at this point to discuss why this should be the case even though divorce 
and spousal death appear to be quite different phenomena. 

The relative well-being of divorced wives and husbands or of widows and 
widowers is closely linked to economic independence of the individual 
spouses and the mechanisms developed by society to facilitate or compel the 
sharing of resources beyond the married state. As is true of divorced women, 
widows lose direct access (previously obtained through shared living) to any 
income that was previously (and, in divorce, continues to be) paid solely to 
the husband. Consider; for example, two couples with identical earnings 
contributed by the husband. At the time one couple separates into two houses 
and the husband of the other dies, the female-headed households by definition 
no longer have direct access to the (ex- or deceased) husband's earnings. 

The divorced wife may be able to negotiate some sharing (perhaps by court 
order) in her ex-husband's earnings, while for the widowed woman no such 
initiative after marital dissolution is possible. On the other hand, the widow 
may be shielded by private or social insurance against the income con- 
sequences of her husband's death; the absence of equivalent "divorce" in- 
surance removes that source of predissolution-determined security for the 
divorced woman. If eventually divorced or widowed couples are no different 
with respect to the control of income when married, and if husbands fail to 
protect their wives against the risk of marital dissolution, observed effects of 
widowhood and divorce will be similar. 

In the United States, social security provides almost universal insurance 
against the economic consequences of widowhood. Survivor benefits are paid 
either if the widow has a child under the age of 16, or is 60 or older (50 if 
disabled). Younger widows without young children receive no protection. A 
second leg of protection for widows is the pensions in which their husbands 
may have been vested. Since the passage of the 1974 Employee Retirement 
Income Security Act, pensions must offer a joint and survivor benefit (that is, 
a pension some share of which continues to the survivor) as the default form. 
Nevertheless, pension income may not continue to the widow-—husbands may 
choose a single life pension (although under the 1984 Retirement Equity Act 
the wife must concur), pension plans may offer lump sum payments as the 
primary payment form (thereby by-passing the ERISA requirement), and 
pensions may deny paynient of survivor benefits to the widow until that time 
the deceased husband would have reached age of eligibility for retirement 
benefits. Holden (1990b) recently has argued that this last provision plays a 
role in the lack of pension benefits paid to young widows whose husbands had 
reported being vested prior to death. In short, the child- and age-related 
requirements imposed by social security and private pensions on the receipt of 
survivor benefits may result in substantial numbers of women being left 
unprotected against the short-term economic consequences of widowhood. 
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Auerbach & Kotlikoff (1987) show that men tend to severely underinsure 
themselves (through the purchase of life insurance) thus failing to protect their 
wives against the consequences of being widowed. 


Longer-Term Effects: Do Widows Recover? 


Unfortunately, available data have not permitted examination of the con- 
sequences of widowhood for many years iato widowhood. The RHS follows 
recent widows for a maximum of eight vears (1971-1979), and the PSID 
covers women widowed beginning in 1970. Even then, widows continuing in 
the sample for the longer period are too few to draw conclusions about very 
lengthy widowhood. Restricting samples to women with more than one year 
of marriage further limits the length of the widowed period. Hence most 
conclusions about “long-term” effects are based on changes in income after 
the first year or few months (in SIPP) of widowhood. 

Young widows can be expected to recover over time from observed sharp 
falls in income as they reenter the work force or become age-eligible for 
survivor benefits from social security or private pensions. It is generally 
assumed that the possibility of recovery for older women is small. If already 
pension recipients, they are unlikely to see real increases in survivor benefits 
(Clark & Sumner, 1985), remarriage is a small probability (Zick et al 1991— 
forthcoming), and the chances of successfully entering the work force di- 
minish with age. 

Those who become poor do not necesszrily remain so; about a third who 
enter poverty later exit it (Bound et al 1982, Coe 1988, Smith & Zick 1986, 
Burkhauser et al 1986). Limited evidence suggests, however, that some of 
these exits are themselves temporary (Burkaauser et al 1986) and that after the 
initial shock of widowhood, the marginal changes of entering poverty do not 
diminish (Holden et al 1988). 

Moving out of poverty is a limited measure of "recovery" since some 
women, plunged into poverty by a large fail in income, may exit after only a 
small increase in income. Zick et al (1991—forthcoming) conclude that while 
some recovery in the income-to-needs ratie takes place after widowhood, five 
years into widowhood the ratio remains well below that when married. 
Holden (1990b) finds and reports recovery among younger widows after the 
first months of widowhood, reflecting perhaps delayed payments of survivor 
benefits, but income-to-needs does not recover to prewidowhood levels. Like 
divorce, widowhood exacts a permanent cost on widows not observed when 
men lose their spouses. In contrast to divorce, increased earnings are not 
recorded among young or older widows; in fact, Holden (1990b) as well as 
Zick et al (1991—forthcoming) report declines in women's total earnings 
following their husbands' deaths. 
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MEASUREMENT ISSUES 


Despite the literature's accord that widowhood and divorce have negative 
economic consequences for women, there is little consensus about their 
precise magnitude and causes. We outline below key factors that account for 
the diversity of specific estimates and uncertainties about its causes. 

First, studies of divorce use different data. Weitzman's (1985) sample is of 
men and women obtaining legal divorces in 1977 in Los Angeles County 
(CA). Since this sample is weighted toward couples with higher incomes, 
Weitzman's estimates of economic decline are the largest. Other studies use 
nationally representative samples, but these vary in coverage. For example, 
Mott & Moore (1978) use the National Longitudinal Survey (NLS) of young 
women (ages 14—24 in 1968) while Nestel et al (1983) use the NLS's mature 
women sample (ages 30-44 in 1967). We might expect older cohorts of 
divorced women to experience more severe falls in economic well-being, due, 
in part, to lower levels of labor force participation. Indeed, Nestel et al 
estimate larger declines upon divorce for most subgroups than do Mott & 
Moore. The PSID, a national sample of families, is a more representatively 
aged sample and thus records the experience of all age groups. 

Because the RHS and PSID are the basis for most longitudinal studies of 
widowhood, fairly uniform findings are reported. But diversity in sample 
size and age distribution suggests care in interpreting empirical findings. The 
RHS is a sample of widows of men within a select age range who died fairly 
young (between 1969 and 1979 when all would have been 68-73). The PSID 
is a more representatively aged sample.” However, the RHS is far larger, 
initially interviewing over 8000 married men, and recording more than 1200 
widowed events. PSID records about half that many events over a longer 
period. 

Second, studies employ different sample selection criteria. Some impose 
restrictions on age (Corcoran 1979, Duncan & Hoffman 1985), on duration of 
marriage and time spent separated or divorced (Stirling 1989), parenthood 
status (Weiss 1984), and, commonly, the period during which a marriage or 
marital disruption must be observed. Additionally, definitions of marital 
disruption differ. Most studies define both separation and divorce as in- 
dicators of marital disruption, but, as noted above, Weitzman considers only 
legal divorces; excluding separated women may overestimate levels of postdi- 
vorce economic well-being, since women who remain separated are often 
unable to afford a divorce because they are least well-off just prior to and after 


^An exception is Morgan (1981) whose study of NLS Mature Women reports little change in 
economic status. 
5RHS widows can be of any age; only married men were selected by age. 
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separation (e.g. Peterson 1989). Stirling (1989) includes women who report 
themselves as separated only if the separation culminates in divorce. The most 
inclusive definition is Duncan & Hoffman's (1985), who define cohabiting 
unions as marriages at risk of disruption. Inclusion of this group, least likely 
to have children and shared resources, may account in part for their estimates 
of relatively small income-to-needs change. 

Third, estimates are sensitive to the azcounting period over which: eco- 
nomic status is tracked. The preferred choice is to compare economic status in 
the year following marital disruption to th» year prior to disruption; this pegs 
estimates to the year of disruption for all cases. Studies using the NLS or RHS 
face restrictions in this regard since interviews were not conducted every year 
(Mott & Moore 1978, Nestel et al 1983). A variant with significant ramifica- 
tions is the measurement of change over a fixed time of several years (e.g. 
Hoffman 1977, Corcoran 1979). Separaticn or divorce may occur at any tíme 
after the start and before the close of the interval. Intra-sample heterogeneity 
in the length of the pre- to postdisruption interval precludes precise descrip- 
tion of the linkage between marital and economic change. 

Fourth, some studies fail to recognize an error that may arise if the first 
interview of widowhood or separation/d_vorce is shortly after the marital 
disruption (Burkhauser et al 1986). Consider a divorce or spousal death first 
recorded at the time of interview during tae first months of year T, with the 
income reference period being the previous year. Some respondents will have 
been divorced or widowed the entire (or most of the) reference year, but, for 
others, marital dissolution occurred some months later. The standard pro- 
cedure is to measure income during a reference period (e.g. previous year) of 
current household members, ignoring income contributed by the absent (or 
deceased) spouse when still a member of the household. The NLS and RHS 
follow this procedure, while the PSID combines the income received by the 
household prior to and after disruption in that year. SIPP provides income of 
every household member present in each month except for very recently 
deceased members. Reluctant to ask be-eaved persons about the recently 
deceased, the Census Bureau imputes fheir income on the basis of de- 
mographically similar living individuals. Holden (19892) shows that these 
imputations result in numerous false poverty transitions during that period. 

Including the ex-spouse's income in the income reference year of the first 
divorced/widowed interview, however, will overstate postdisruption income 
because during part of the year the couple was together. On the other hand, 
estimates that count only the respondent's income in that year will understate 
actual economic well-being for that year (because other sources of income 
were available for part of it). The most eppropriate comparison is, thus, of 
economic status during the first full income reference period spent widowed 
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or divorced to a period preceding marital disruption (e.g. Burkhauser et al 
forthcoming, Weiss 1984, Duncan & Hoffman 1985, Stirling 1989). This will 
of course lengthen the pre- to post-dissolution period for some observations 
and compound problems associated with time-dependent attrition. 

Finally, estimates of the consequences of divorce and widowhood are 
complicated by selective survival, that is, attrition from the previously mar- 
ried sample through remarriage and death which are conditioned in part by the 
economic variables of interest (Smith: 1989). Studies of marital disruption 
vary in their treatment of those who die or remarry during the panel study. 
Sample selection sometimes excludes women who die soon or remarry rapidly 
by stipulating that the respondent be separated, divorced, or widowed in two 
consecutive survey periods (Nestel et al 1983, Weiss 1984). Stirling (1989) 
deliberately limits her sample to long-term divorced women—those divorced 
for at least five years. Hoffman (1977) and Corcoran (1979) restrict their 
samples to those not remarried at the end of the accounting period, which may 
be several years after marital disruption. In a similar fashion, Coe (1988) 
restricts his sample of PSID women to those with at least three years of 
postdissolution data. All such restrictions disregard the postdisruption experi- 
ences of women that remarry or die fairly quickly, and estimates of change 
will not reflect the experience of all women whose marriages disrupt.® 

A common strategy is to allow cases to exit from the sample at the time of 
remarriage (Mott & Moore 1978, Weiss 1984, Duncan & Hoffman 1985, 
Burkhauser et al 1991—forthcoming, Holden 1990b). Again, note that es- 
timates must be interpreted cautiously since economic well-being through 
time pertains to the average status of a changing group with results reflecting 
the greater weight given to those that remain unmarried at successive dura- 
tions. Duncan & Hoffman thus present two sets of estimates, one for all 
women—including those who remarry—and one for those who remain un- 
married.” 


$How this affects estimates of the level or change in economic well-being following divorce is 
unclear. Divorced women who have relatively low labor supply and earnings are more likely to 
remarry (Peterson 1989). Duncan & Hoffman (1985) suggest those who remarry "gain" the most 
economically by remarriage. Further, black women are less likely to remarry than white women, 
and remarriage declines with increasing age (e.g. Sweet & Bumpass 1987). Because relatively 
prosperous widows are more likely to remarry and men remarry more often than women, 
exclusion of remarried individuals will reduce the observed difference between widows and 
widowers. This bias may be offset somewhat by the higher mortality of poorer widowers. 

"Sample selection bias becomes an important issue if we are interested in explanation rather 
than just description. To the extent that dissolution, death, and remarriage are associated with 
economic characteristics of individuals, estimates of the economic consequences of the dissolu- 
tion itself will be affected (Glass et al 1983, Peterson 1989). 
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CONTRIBUTING FACTORS 


Studies of the economic consequences of divorce and widowhood are largely 
descriptive. This is not surprising; the literature is still grappling with the 
complex nature of the association between marital status change and women's 
economic insecurity. This complexity ar&es because association may not 
reflect causality, and it has been difficult t sort out the influence of marital 
dissolution itself from other pre- and post-cissolution influences. Further, the 
set of apparently crucial determinants varies among subgroups of women and 
makes it difficult to develop a comprehersive theoretical framework to ex- 
plain how families prepare for and react to separation/divorce and widowhood 
and the consequences for them when those two events take place. Finally, 
measurement of some key variables that zre expected to influence the eco- 
nomic consequences of marital dissolution is difficult either because of their 
time dimension (e.g. influence of parents; or qualitative nature ("socializa- 
tion" in school). Below we categorize facters associated with postdissolution 
outcomes into: (a) factors existing prior to the death or divorce; and (b) factors 
directly associated with the event of mar-tal dissolution and the postevent 
period. 


Preexisting Factors 


One explanation for the observed economic insecurity of separated, divorced, 
and widowed women is that marital disrup=ion is selective of those relatively 
less well-off to begin with. Marital instability and death are inversely corre- 
lated with indicators of socioeconomic statis such as educational attainment, 
income, and husband's employment stability (Ross & Sawhill 1975, Hoffman 
& Holmes 1976, Becker et al 1977, Mott & Moore 1978, Espenshade 1979, 
Martin & Bumpass 1989). While women about to be divorced are more likely 
to be working and have relatively high earnings compared to their husbands, 
divorce and widowhood are selective of women with low education and 
earnings prospects (Moore 1979, Hurd & Wise 1989).* Income and wealth of 
couples who eventually separate or who are widow(er)ed is lower at the start 
of the observation period than is the income of those that remain continuously 
married (Hoffman 1977, Mott & Moore 19278, Holden et al 1986, Zick et al 
1991—forthcoming). 

While initial differences in socioeconorric status and earnings capacity are 
an important reason for the poverty of seme women following divorce or 
widowhood, they have two central limitatzons as an explanation of observed 


3Studies show that higher earnings (through better health care and more “healthy” lifestyles) 
reduce the risk of illness and morbidity (Anderson z Burkhauser 1985). 
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changes in economic status.? First, death and divorce occur throughout the 
socioeconomic spectrum, with some of the largest economic declines occur- 
ring to women who were well-off prior to disruption. Also, in contrast to 
divorce, death must eventually strike virtually all couples, and even when 
unexpected and apparently unrelated to economic status, must exact a severe 
economic cost to the survivor. Second, emphasis on predissolution economic 
status does not account for the gender differentials that emerge across the 
income spectrum in postdisruption ecónomic status (Duncan & Hoffman 
1985, Weitzman 1985). 

A second factor is the preexisting economic vulnerability of women in 
general relative to men, owing to the disadvantages women face in the labor 
market regardless of marital status. The most obvious indicator of this is the 
ratio of female to male wages, recently estimated at .65 among full-time, 
year-round workers aged 16 and over, although for younger cohorts that gap 
may be narrowing (Smith & Ward 1984, Bianchi & Spain 1986, Blau & 
Ferber 1986, Marini 1989). Two models have been offered to explain this 
wage differential. Human capital theorists attribute the lower education, 
training, and work experience of women to individual choices made on the 
basis of anticipated or actual family responsibilities. A second model gives 
greater emphasis to labor market discrimination that reduces the returns to 
education and training, and limits job options and work experience for 
women. The point here is not to evaluate the relative contribution of various 
factors to the wage gap women face prior to, during, or after marriage (see 
e.g. Corcoran & Duncan 1979, Mincer & Ofek 1982; also see Marini 1989 for 
a review of the wage gap and its sources). The crucial point is that whatever is 
the source of their lower earnings potential, when thrust into the role of 
primary earner, formerly married women are unlikely to match the earnings of 
the deceased or former husbands. 

The ability to successfully combine continued market work with marriage 
and parenthood is a strong predictor of a formerly married woman's chance of 
maintaining her economic well-being beyond marriage. Unfortunately, very 
few studies have related the effects of work and family decisions during 
marriage to postdissolution well-being. An important exception is Peterson 
(1989) who finds that women who did little market work during marriage 
experience significantly more economic hardship after divorce compared to 
women with relatively continuous or even intermittent work experience. 


?For lower income, minority women economic vulnerability is not catalyzed by marital 
disruption. Wilson (1987) sees the lack of jobs paying adequate wages to black males as a cause 
of marital instability and the decline in marriage among blacks (see also Malvaux 1985). 
Empirical evidence supports this position. Bane (1986) shows that a majority of black single 
mothers were poor before becoming single parents, although this was not true for white women. 
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Divorced “homemakers” were more likely te receive public assistance, enter 
poverty, and have a lower standard of living after divorce. Likewise, women 
who work prior to widowhood are less likely to enter poverty or suffer a sharp 
decline in income-to-needs when widowed (Holden at al 1988, Zick et al 
1991, forthcoming). 

How does being married and a parent (versus general social or market 
forces affecting all women) influence earnings potential? That they do is 
shown by the advantaged earnings and attainment positions of never-married, 
childless women (see Treiman & Terrell 1975, Hudis 1976, Hanson 1983, 
Mincer & Ofek 1982, Haggstrom et al 1984, Marini et al 1989). While it is 
difficult to disentangle the effects on earnirgs of parenthood from those of 
marriage—since the majority of married couples have had or anticipate 
having children—there is a large literature suggesting some central mech- 
anisms. First, marriage itself may limit women’s job options and earnings, as 
evidenced by the cost of family migration foz wives both in terms of earnings 
(Sandell & Koenig 1978) and future earnings-based retirement benefits from 
social security and pensions. Second, marriage and parenthood are character- 
ized by gender inequality in the division of tome work. As mothers of minor 
children, women tend to be the primary parent and to do the larger share of 
nonmarket work even if they also work outside the home (Stafford 1980, Cain 
1984, Berk 1985, Coverman & Sheley 1936, Robinson 1988, Hochschild 
1989). Estimates of the division of houseaold labor indicate that women 
spend twice as much time as men per day doing housework and caring for 
children (Berk 1985). Third, when family responsibilities are associated with 
reductions in market work, this usually falls to women. Many women with- 
draw from the labor force in the time surrounding childbirth (Mott & Shapiro 
1978, Cramer 1980, McLaughlin 1982, Most & Shapiro 1983, Haggstrom et 
al 1984). Using data from the NLS-High ScEool Class of 1972, Haggstrom et 
al (1984) find that while most married vomen hold paid jobs prior to 
pregnancy, they begin to leave work at the tme of conception. By the month 
of first birth, only 22% still hold a job. By the time the child is two, 
employment rates reach only 60% of pre-Lirth levels. Quite beyond work- 
related effects, parenthood may influence married women's educational in- 
vestments. Haggstrom et al find that among married women first births are 
associated with reductions in school enrolment. Similar employment and 
educational effects are not apparent amorg married men, and migration 
typically raises men's earnings. 


Direct Sources 


Substantial evidence suggests that economically vulnerable couples are 
selected into widowhood and divorce anc that marriage and parenthood 
contribute to the lower earnings potential of women. In this section 
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we explore the mechanisms by which women's economic vulnerability may 
be compounded further by the divorce and widowhood event. 

First, protected in marriage against their own economic vulnerability by 
husband's earnings (or retirement income), women's loss of automatic shar- 
ing in those earnings is the primary reason for postdissolution hardship. When 
the implicit insurance from husband's earnings is lost, women face the 
“preexisting” obstacles to fully replacing this lost income through their own 
work effort (see e.g. Garfinkel & McLanahan 1986). 

Second, women commonly remain the primary child caretaker when di- 
vorced, and when widowed they must remain so. Few divorced fathers ask for 
or are awarded physical custody (Weitzman 1985, Maccoby et al 1988), 
perhaps reflecting the fact that the vast majority of divorced women were also 
the primary parent during marriage.!? Child custody practices severely con- 
strain the time women have available for market work. The Jack of affordable 
child care increases the constraints children place on greater market work; few 
employers in the United States provide full-time jobs that integrate parental 
and market work responsibilities, by, for example, providing on-site child 
care or allowing leave to care for sick children (Kamerman & Kahn 1988). 
This is exacerbated by the greater likelihood of women working in jobs that 
provide no benefits, either because of part-time employment or occupational 
concentration in (service) jobs and (small) firms, !! 

Third, separated and divorced women with children are usually the main 
financial contributors to their children's well-being. In theory, children of 
divorced parents, unlike children of widows, have two parents to depend upon 
financially (Bane 1976).? In fact, however, a substantial proportion of 
nonresidential fathers make little economic contribution to their children's 
lives, due to both low award levels and high noncompliance with awards. 
Nationally, 61% of separated, divorced, and never-married mothers had a 
child support award in 1985. Only one half of these women received full 


V Child custody decisions may not be independent of support awards and property settlements. 
Mothers may face threats of custody loss if they pursue higher support awards or more equitable 
property division (Weitzman 1985, Arendell 1986). There is, however, no evidence of a 
relationship between legal custody (i.e. whether sole or joint) and child support awards when 
controlling for the higher income of fathers with joint custody (Seltzer 1991). 

Children may remain covered by the absent parent's health policy. The divorced wife may 
purchase a policy during a 36 months continuation period, but is not likely to have the premium 
subsidized by the employer. 

Not all fathers are in a financial position to contribute child support or to insure against their 
deaths. As Malvaux (1985) suggests, high unemployment rates among blacks argue for job 
creation rather than child support enforcement as a means to improve the lives of minority 
children. US. GAO (1988) shows that pension legislation mandating survivor benefits would not 
reduce poverty rates among widows since the husbands of poor widows were unlikely to have 
high paid, pension-covered jobs. 
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payment, one quarter received partial payment, and another quarter no pay- 
ment (US Bureau of the Census 1989). This aigh rate of noncompliance is not 
apparently due to the inability of nonresiden-ial fathers to pay award amounts 
(Chambers 1979). Among those without an eward there is evidence that little, 
if any, other financial aid is contributed to th» residential parent (Roan 1989). 

Even in the event of full compliance wita child support awards, mothers 
and dependent children are likely to experience severe declines in their 
standard of living, whereas the economic well-being of fathers will be rela- 
tively unaffected (Chambers 1979). Support awards rarely exceed 30% of the 
nonresidential parent's net earnings, and that proportion typically declines 
with the income of the nonresidential parent (Weitzman 1985). Though states 
have drafted guidelines for support awards by family income level, judges 
still have considerable discretion in setting award amounts (Weitzman 1985). 
In 1985, the average yearly payment to recipients of any child support was 
just $2500 for divorced women and $2100 fcr separated women (U.S. Bureau 
of the Census 1989). These amounts do not »egin to cover childrearing costs, 
or even the average cost of child care (Seltzer 1989, Weitzman 1985, Aren- 
dell 1986, Weitzman 1988). 

If upon divorce women received large property settlements and if husbands 
left large estates to widows, income-based measures would underestimate the 
actual level of economic well-being of women following marital disruption. It 
is broadly accepted that full measures of economic well-being—both of 
married couples and unmarried men and women—should include the value of 
property and financial wealth. The issue is whether the share of the married 
household’s assets given to the wife following divorce or husband’s death 
compensates for the disproportionate income outcomes for the wife. 

Available evidence does not suggest such Zompensation. Divorcing couples 
generally have little to divide in terms of tangible assets and property, such as 
homes, other real estate, businesses, savings, and marketable securities 
(Weitzman 1985, Seltzer & Garfinkel 19907. In an analysis of approximately 
1800 divorce cases in Wisconsin counties, Seltzer & Garfinkel report that the 
median value of marital property at the time of divorce is just $7800; the mean 
percent of this property awarded to mothers is slightly over 5076. Because of 
the low monetary value of most property settlements, marital property con- 
tributes only slightly to postdivorce economic well-being for either men or 
women (Seltzer & Garfinkel 1990). 

Because of the age distribution of widowhood, the most valuable assets 
held by these couples are their home, pension, and social security. The share 
of a couple's social security benefits paid to -he widow is fixed by law, but the 
share remaining of other assets depends apon the bequest and insurance 
choices of the husband. Prior to widowhocd, couples have little "bequeath- 
able" nonhousing wealth—estimated by Hu-d & Wise (1989) at just $11,005 
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(1979 dollars) among RHS couples not poor prior to widowhood—a large 
share of which does pass to the widow. But the most valuable assets— 
pensions and private annuities—are almost dissipated upon the husband's 
death. Full measures of wealth lead to the same conclusions about declines in 
economic status associated with widowhood (Holden 1989b, Hurd & Wise 
1989). 

In view of this, in divorce cases an expanded definition of "divisible" 
marital property to include spouse's potential earnings and pensions would 
partially compensate women for the loss not just of their husband's current 
income, but of the future security jointly built through the work efforts of the 
earner(s), the indirect contributions of each spouse, and investments in the 
family during marriage (Weitzman 1985, Arendell 1986). Current definitions 
of marital property as tangible assets contribute to women bearing a dis- 
proportionate economic cost of divorce. 

Finally, US social policy alleviates economic hardship following divorce 
and widowhood only for a small group of women (Folbre 1984, Kamerman 
1984, Garfinkel & McLanahan 1986, Burkhauser & Duncan 1989). Unlike 
other industrialized countries, the United States fails to provide universal 
child allowances, subsidized childcare, national health insurance, or guaran- 
teed minimum child support benefits that might otherwise smooth the income 
effects of marital dissolution (Kamerman & Kahn 1978, Kamerman 1984, 
Garfinkel & McLanahan 1986, Kamerman & Kahn 1988). Currently, Aid to 
Families with Dependent Children (AFDC), the sole public transfer program 
that aids separated and divorced women, is restricted to women with de- 
pendent children and whose income is below a state-defined needs standard. 
In combination with food stamps, it provides a small proportion of total 
income for women following separation and divorce; its use is largely re- 
stricted to those receiving little in the way of child support and who were 
lower-income while married (Hoffman 1977, Corcoran 1979, Weiss 1984, 
Duncan & Hoffman 1985, Stirling 1989). On the other hand, widows—if they 
have children in their care or are 60 or older—are eligible, without income 
and asset testing, for survivor benefits from social security. Compared to 
AFDC transfers, these are relatively generous; in 1988 average monthly 
benefits to widows with one child were $921 per month, an amount that 
would alone raise a two-person family to 3596 above the poverty level. 


CONCLUSIONS AND IMPLICATIONS 


Despite variability in measures of economic status, data sources, analytic 
design, and sample selection rules, the longitudinal studies unanimously 
conclude that widowhood and divorce cause a decline in the economic status 
of women and their dependents, especially for particular subgroups. This is 
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not the case for men, where divorce and w dowhood may lead to enhanced 
economic well-being. Quite beyond issues of short-term change or the relative 
deprivation of women compared to their ex-spouses or widowers, the evi- 
dence suggests prolonged economic hardship for women and, if present, their 
children. The finding that remarriage remains the only clear means to recov- 
ery among divorced women merely underscores women's economic vul- 
nerability. 

Research to date has largely concentrated on documenting changes in 
economic well-being in the time surroundinz widowhood and divorce. Still, 
we know a good deal about the conditions that cause greater economic 
insecurity for women than men outside marriage. Central are the average 
lower wages paid to women both during marriage and after, a phenomenon 
that can be “explained” in part by educat onal differences, reduced labor 
market opportunities (owing to labor market discrimination and occupational 
concentration), and less work experience. 

Taken together, it is difficult to escape -he conclusion that much of the 
association between marital dissolution and women's economic insecurity lies 
in the fact that parenthood's costs are disproportionately borne by women 
even long after the children leave home. Tris division of responsibility may 
be functional within marriage if, as some theorists argue, women have a 
comparative advantage in home labor owing either to their biological commit- 
ment to childbearing (Becker 1981) or to role socialization processes and 
labor market discrimination (e.g. England & Farkas 1986). In marriage, 
husbands’ earnings and pensions "insure" their wives against lower earnings 
potential. Once outside the marriage the eccnomic shortfall exacts a high toll 
on the life chances of children and single mothers (e.g. McLanahan 1985, 
Garfinkel & McLanahan 1986, McLanahai & Bumpass 1988). Continued 
sharing after marriage must come in the form of explicit transfers from 
husband to wife, either through the division of assets, child support payments, 
or insurance against death. The absence of such transfers is compounded by 
women's disproportionate financial and timz responsibility for children after 
divorce (and sole responsibility after widowhood), the incompatibility of 
primary parenting and full-time work, and the lack of affordable child care. It 
is not surprising that approximately one in two female-headed families with 
children lives in poverty and that over one quarter of elderly women alone are 
poor. 

A strong case can be made that children should continue to share the 
standard of living of the higher-income pareat, who is usuallv the nonresiden- 
tial father. In other cases, where poverty and unemployment affect both 
parents—and meager combined resources now must support two house- 
holds—children and their caretakers will recuire a minimum assured income. 
Particular emphasis in policy dialogues is thus placed on ensuring adequate 
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child support, whether from the noncustodial parent or through gov- 
ernmentally guaranteed minimum benefit levels (e.g. Garfinkel & McLana- 
han 1986, Kamerman 1988, McLanahan & Booth 1989). 

For households without children, a case can be made that security of 
income is the key goal of income support programs (Lampman 1984), and 
that when husbands die or leave women alone the latter should continue to 
share in the economic prosperity of the ex- or deceased spouse. Thus, 
insurance against potential earnings loss might be required that would be 
effective at all ages, not just when survivors are elderly. 

Prospects for the future are uncertain. Divorce rates are expected to remain 
high, but it is conceivable that the economic consequences will be modified. 
The literature to date has focused on divorces occurring in the late 1960s 
through the mid-1970s. The costs of divorce and widowhood may become 
more equally distributed between men and women as greater work experience 
and smaller families enable women to accumulate a larger share of the joint 
capital of the family (Sgrensen 1991b). At the same time, their employment 
may actually deter their being awarded a greater share in the higher income 
assets of their husbands and men with working wives may be even less likely 
to forego consumption now in order to insure their wives against future 
widowhood. Consequently, unless changes in female work roles are mirrored 
by social policy initiatives on several fronts as well as men's assumption of 
equal responsibility for children (both within and out of marriage), economic 


prospects for many women and children will continue to be grim. 
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Abstract 


Rather quietly over the last decade, a large body of literature has emerged to 
consider how new forms of organization arise and become established in the 
organizational community. The literature represents a very wide array of 
theoretical perspectives, and no emerging consensus or dominant theme can 
plausibly be identified. No long stream of research has been produced to 
validate the arguments of any perspective. What we find instead is a disparate 
group of mostly nascent theories from organizational ecology, economics, 
institutional sociology, strategic management, and others, all seeking to 
explicate the nature of contexts and processes that may generate new organi- 
zational forms. This review organizes this literature according to assumptions 
about how variations are generated in the organizational community. Three 
perspectives appear to capture most of the arguments: an organizational 
genetics view, which emphasizes random variation; an environmental con- 
ditioning view, which considers variation to be contextually constrained; and 
an emergent social systems view, which considers variations in organizational 
forms to be the products of embedded social-organizational interactions. 
Theories associated with each of the perspectives are explicated, and their 
practical implications for future research are examined. The review concludes 
with a brief consideration of the theory of the evolution of new organizational 
forms as itself an evolution of a new and important field of study. 
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80 ROMANELLI 
INTRODUCTION 


Over the past decade, a chorus of critics aud commentators (e.g. Aldrich & 
Mueller 1982, Astley 1985, Fombrun 1988 Hawley 1988, Romanelli 19892, 
Meyer 1990) have emphasized the origins of new organizational forms as one 
of the critical unaddressed issues in organizational sociology. Interest in the 
question stems back at least to Stinchcombe (19652), who proposed that the 
array of organizational forms existing at ny point in time is a product of 
innovative organizational responses to environmental conditions earlier in 
history. Organizational ecologists (Hannan & Freeman 1977, 1989, Aldrich 
1979) underscored the importance of origins when they emphasized variation 
as a key theoretical construct for explainirg the evolution of organizational 
populations. Others (e.g. DiMaggio 1983, [988, Nelson & Winter 1982, Van 
de Ven & Garud 1989) have described important policy implications of new 
organizational forms for both government agencies and corporate managers. 
Hannan & Freeman (1989:3) offered perhaps the strongest rationale for 
investigating the evolution of new organizational forms when they stated: “the 
ability of a society as a whole to respond to changing conditions depends on 
the responsiveness of its constituent organizations and on the diversity of its 
organizational populations." Organizational diversity sets the limit on the 
range of alternative solutions that are available in the environment. In the long 
run, in a changing environment, diversity cen only be maintained or increased 
by the introduction of new organizational forms. 

Even more than theoretical arguments, however, recent empirical work on 
the competitive and institutional dynami-s of organizational populations 
identifies a need for investigation of the origins of organizational forms. 
Studies of age dependence in organizatioral mortality (e.g. Carroll & De- 
lacroix 1982, Freeman et al 1983) demorstrate that younger organizations 
face a greater likelihood of failure than older firms. Presumably, new organi- 
zational forms are similarly vulnerable to wnat Stinchcombe (19652) termed a 
liability of newness. Studies of populations that compete in overlapping 
resource spaces (e.g. Hannan & Freemar 1987, Barnett & Carroll 1987, 
1990, McPherson & Smith-Lovin, 1988, Barnett 1990) also reveal rather 
strong limits on the number of organizatioral forms that can coexist. Finally, 
from a different line of argument, studies from institutional sociology (e.g. 
Rowan 1982, Tolbert & Zucker 1983, Fligstein 1985) show how in- 
stitutionalizing processes in organizational fields (DiMaggio & Powell 1983) 
engender widespread structural conformity across organizations that offer 
similar products and services. In other words, while organizational diversity 
may be crucial to sustained economic and social well being, strong forces of 
competition and institutionalization work to reduce the diversity of organiza- 
tional forms. 
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Despite the evident need for theory and research on the evolution of new 
organizational forms, organization theorists have only recently begun con- 
sidering how new forms might arise and become established. No theoretical 
consensus exists regarding an approach to the problem. In fact, in the absence 
of much empirical work to support the claims of any approach, the conceptual 
perspectives are diverging. Rather than indicating an occasion for synthesis or 
integration, however, this theoretical diversity, like organizational diversity, 
may provide a fruitful ground for conceptual and empirical innovation. 

The purpose of this review is to draw together and organize this diverse 
body of literature on the evolution of new organizational forms. I begin with a 
brief examination of the concept of organizational form and its implications 
for the study of organizational form evolution. The body of the review 
organizes perspectives on evolution according to their understandings about 
the contexts and processes that tend to generate organizational form var- 
iations. Three views are examined: (a) An organizational genetics view (e.g. 
McKelvey 1982, McKelvey & Aldrich 1983, Nelson & Winter 1982) focuses 
attention on characteristic traits of organizations; this view considers varia- 
tions to be random events arising from the ongoing exchanges of everyday 
activity. (b) An environmental conditioning view (e.g. Stinchcombe 1965a, 
b, Brittain & Freeman 1980, 1986, Hannan & Freeman 1989, Romanelli 
19892, Aldrich & Waldinger 1990) emphasizes that environments are more or 
less conducive to organizational form variations and constraining of the types 
of variations that can be invented. (c) Finally, an emergent social systems ` 
view (e.g. Van de Ven & Garud 1989, Rappa 1989) considers organizational 
form variations to be the products of embedded social-organizational in- 
teractions. Research implications of each perspective are discussed along the 
way. I conclude with an argument that these many different emerging theories 
on the evolution of new organization forms are themselves part and parcel of a 
new organizational theory. 


THE CONCEPT OF ORGANIZATIONAL FORM 


To speak of the evolution of new organizational forms implies that the 
concept of organizational form is relatively unambiguous. If we want to know 
how something comes into being, it seems reasonable that we should be able 
to identify the thing when it appears. Certainly, understandings of the concept 
should span various theories of organizational form evolution. While none of 
these objectives is easily obtained in the current state of organization theory, I 
argue that the mere imputation of a concept of form serves to organize and 
advance investigation of changes in organizational diversity. 

Most broadly, the concept of organizational form refers to those character- 
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istics of an organization that identify it as a distinct entity and, at the same 
time, classify it as a member of a group of similar organizations. For 
example, in a community, we can distinguish the pizza place from the gas 
station down the street based on differences in the organizations’ products, 
services, equipment, physical layout, and so on. We can further differentiate 
among pizza places themselves based on varieties of pizza offered, whether 
they deliver, and whether they offer food other than pizza. We can also 
consider pizza places simply as restaurants. which are distinctively different 
from mining companies. All of these wzys of categorizing pizza places 
represent legitimate distinctions among organizational forms. 

Despite the ease with which we may identify meaningful groupings of 
organizations, no commonly accepted classification scheme has been de- 
veloped. Theorists debate the usefulness of a general scheme. At one end of 
the argument, McKelvey (1982) emphasized the need for a taxonomy of 
organizational forms, i.e. for theory and methods that can classify organiza- 
tions according to their differences and similarities. The clear objective of 
taxonomic research is development of z relatively stable classification 
scheme. Such a scheme would improve our confidence regarding the 
generalizability of research findings. To the extent that the scheme repre- 
sented the complete set of existing organizational forms, new forms could be 
easily identified. Unfortunately, no organizational taxonomy has been de- 
veloped. 

At the other end of the debate, Hannan & Freeman (1977, 1989) explicitly 
refrained from proposing any fixed rules or -ypology for identifying organiza- 
tional forms. They argued that form max be generally inferred from an 
organization's formal structure or normative order, and that the classification 
of an organization as one form or another may be specified according to the 
interests of the investigator. For example. Freeman & Hannan (1983) de- 
scribed how 13 distinct forms of restaurants in California could be collapsed 
to two categories of specialism and generalism (Hannan & Freeman 1977) 
because that was most useful to the pursoses of their study. Brittain & 
Freeman (1980) and Romanelli (1989b) also distinguished organizational 
forms on the basis of organizational strateg-es—e.g. specialists versus gener- 
alists, r- versus K-strategists. Other studies have usefully classified organiza- 
tions on the basis of differences between product/service classes; e.g. Carroll 
& Hannan’s (1989) study of density delay in mortality rates for five pop- 
ulations—American labor unions, Argentizean newspapers, Irish newspap- 
ers, newspaper publishers in the San Francisco region, and American brew- 
ers. Finally, some researchers have defined organizational forms on very 
broad and inclusive dimensions. Fligstezn's (1985) study of the multi- 
divisional organizational form and a series of studies by Singh and his 
colleagues (e.g. Singh et al 1986, Tucker et al 1990) on voluntary service 
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organizations in Toronto are good examples of this approach. The principal 
benefit of this more flexible approach to classifying organizations lies in its 
recognition that organizations do not fall neatly into a few, perhaps hierarchi- 
cally ordered, categories. Research can proceed on the basis of intuitive, 
descriptive distinctions. The main drawback is that it is not easy to know how 
findings may generalize to other forms of organization. As an increasing 
number of studies employ this approach; however, the basis for empirically 
identifying dimensions of generalizability will expand. 

Between these two extremes lies a large number of proposals for classifying 
organizations that are neither fixed in the theory of a taxonomy nor as 
generally flexible as the approach advocated by Hannan & Freeman. Some 
investigators have emphasized dimensions of variation in organizational form 
without specifying particular types. For example, Aldrich & Mueller (1982) 
described organizational form variations over the general dimensions of 
technology (the set of processes for accomplishing tasks), coordination (the 
organization of activities toward a common goal), and control (the mainte- 
nance of an organizational boundary). Using these dimensions, they were able 
to trace the origins of several major organizational forms (e.g. factory produc- 
tion, monopolistic control) over the course of industrial history in the United 
States. Tushman & Romanelli (1985) described organizational culture, strat- 
egy, structure, power distributions, and control systems as useful dimensions 
for examining organizational form evolution within product-market classes. 
In both cases, the authors devised these dimensions as a basis for identifying 
changes in organizational forms over time. A basic problem in any evolution- 
ary theory is the specification of significant differences in organizational 
form. The use of common dimensions for comparing organizations over time 
helps identify both the nature and degree of emergent differences. 

Finally, of course, a very large number of organizational typologies have 
been developed. A typology is simply a description of the characteristics of 
distinct organizational groups whose primary differences are identified 
according to the interests or beliefs of the theorist. Rothschild-Whitt (1979) 
offers an excellent example of typological method as she distinguishes the 
collectivist form of organization from the rational-bureaucratic form on the 
basis of characteristic differences over clearly specified dimensions of organi- 
zational activity and structure. Scott (1981) identified 18 organization 
typologies, which he classified according to the “organizational element"— 
goals, social structure, technology, participants, and environment—each 
typology emphasized. His classification scheme is perhaps the closest thing 
we have in the way of an organizational taxonomy. Perhaps an emergent 
theory of organizational classification may be extrapolated from the fit of 
these many typologies within a relatively few dimensions or elements of 
organization. 
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The principal benefit of typologies as a besis for identifying organizational 
forms is that they usually provide a detailed description of different types so 
that organizations may be unambiguousky classified. The drawback, of 
course, is that such specificity may obscure similarities or differences on other 
dimensions or elements of organization. 

What can be learned from this diverse erray of approaches to identifying 
organizational forms? Can the evolution of new organizational forms be 
considered without a consensus on the meaning and measurement of differ- 
ences in organizational form? I propose that the discussion alone, no matter 
how disparate the viewpoints, properly focuses attention on the importance of 
organizational differences and similarities. Two observations merit elabora- 
tion. First, whether or not we ever achieve consensus regarding the meaning 
and measurement of organizational form, the concept has spurred researchers 
to attend more thoroughly to the similarities and differences of organizations 
in their samples. This alone improves prosoects for generalizability. A "ran- 
dom sample of organizations" is no longer an acceptable research procedure. 
We must now know and describe the basiz on which firms are grouped and 
compared. Second, whether or not we car ever agree on a classification of 
organizational forms, the concept emphasizes the importance of establishing a 
baseline for comparing new patterns in organizational activity. Evolution can 
only be known after the fact, when differences in form can be clearly 
identified. 

The perspectives that follow do not o»viously adopt any one of these 
concepts as the basis for their consideratior of organizational form evolution. 
In every case, however, and albeit in various ways, all of the authors indicate 
the existence of relatively stable patterns or forms of organization as the basis 
for investigating the contexts and processes of evolution. Though particular 
organizations may be classified differently according to the interests of an 
investigator, evolution is always examined against the backdrop of some 
system for classifying organizations. The concept of organizational form 
helps clarify the point that evolution is the study of changes in types, not 
changes in individual entities. The process by which evolutionary differences 
emerge becomes open to discussion. 


ORGANIZATIONAL GENETICS: A RANDOM 
VARIATION VIEW 


Jt is appropriate to begin a review of perspectives on the evolution of 
organizational forms with theories that draw directly from biological evolu- 
tionary models. Two works, Organizatiozal Systematics (McKelvey 1982) 
and An Evolutionary Theory of Economic Change (Nelson & Winter 1982), 
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stand out as offering rather complete, if also somewhat different, views of a 
biological analogue of variation in organizational forms. 

The starting point for both of these theories is an elaboration of the concept 
of an organizational gene. Two characteristics of "genes" make them critical 
to construction of an evolutionary theory. First, genes must express the 
characteristic traits that identify the distinctive nature of the organization and, 
at the same time, make clear its appropriate classification based on similarities 
with other firms. Second, these organizational genes must be transmissible or 
communicable so that the characteristic traits can be copied. Nelson & Winter 
describe "routines" as the organizational counterpart of biological genes. 
Routines, which refer to formal as well as tacitly understood rules of be- 
havior, are defined as the regular and predictable behavioral patterns of firms. 
Routines are reflective of historically given decisions and behaviors that have 
come to govern the actions of a firm (i.e. profit maximization as a realized 
goal need not be assumed), and they tend to persist in organizations for rather 
long periods. Somewhat similarly, McKelvey described organizational “com- 
petence elements" (comps) as the analogue of biological genes. Comps are 
base units of knowledge and skill that make up what the organization knows 
how to do. Comps also tend to persist in organizations for rather long times. 

Routines and comps express the characteristics of organizational form that 
are selected for or against by environmental conditions. Natural selection is 
the second key theoretical component of these evolutionary models. Nelson & 
Winter and McKelvey disagree about how variations in routines or com- 
petencies are generated. . 

As the basis of their theory of organizational evolution, Nelson & Winter 
distinguished among three classes of routines: (a) routines, called operating 
characteristics, that govern organizational decisions and behaviors given a 
particular stock of resources; (b) routines that augment or diminish the stock 
of resources in response to changes in the state of the organization (e.g. 
growth, increased/decreased profitability) or the environment; and (c) search 
routines, including the organization's own R&D and its investigation of what 
other firms are doing, that can modify various aspects of the operating 
characteristics. The particular search routines of an organization constrain, in 
a probabilistic sense, what is likely to be found. The organization will alter its 
operating characteristics or not according to the firm's criteria for evaluating 
new alternatives. These variations will then be subject to processes of natural 
selection. At any given point, organizations will present for selection a broad 
array of routines, some long-established and some recently innovated. Those 
organizations whose routines are relatively better fit for coping with environ- 
mental conditions will thrive; other organizations will either imitate the more 
successful routines (i.e. their search routines will discover them, and the 
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firms' evaluation criteria will permit adoption of the new routines), or they 
will decline. 

McKelvey, on the other hand, regardec the organizational population, not 
the individual firm, as the critical focus o7 evolutionary analysis. An organi- 
zational population is defined simply as a group of firms with similar forms, 
or dominant competencies. There are twc important results of this emphasis 
on the population. First, adoption of the pcpulation as the unit of environmen- 
tal selection means that a variation in orgaaizational form must establish itself 
in at least a few organizations before ary selection can occur. McKelvey 
emphasized isolating processes that can provide variations with a hospitable 
resource environment, meaning principzlly that the variation is not be- 
leaguered by competition from existing organizational forms. The second 
important implication of the population focus is that the individual firm in an 
established population is not examined as the place in which variations occur 
or the means by which they are generatec. McKelvey argues that comps are 
carried (at least temporarily) in the minds of individual employees. Because 
people are mobile, and because people can communicate what is on their 
minds, interactions between individuals bzcome the vehicle for the transmis- 
sion of comps. An ever-churning exchange of competencies can occasionally 
result in a new combination of competenze elements, i.e. a variation in the 
organizational form. 

Despite differences in their theories about how variations are generated, 
both Nelson & Winter and McKelvey posited that the nature of realized 
variations in organizational forms (i.e. variations that are selected by the 
environment for retention) is random. Neither theory proposed a guiding hand 
of any sort. Changes in environmental conditions may increase or decrease 
the rates of variation occurring in the crganizational community. For ex- 
ample, from the Nelson & Winter perspective, a technological invention 
might trigger a wave of changes in organizational operating routines set off by 
organizational search routines. From the McKelvey perspective, we might 
expect the establishment of a new organizational population that is isolated 
from competitive forces due simply to a few individuals having knowledge of 
the new technology. Changes in environment will not, however, dictate the 
occurrence of particular variations. Variations in organizational activities may 
be conditioned by the nature of the routines or comps that preceded the 
variations, but there is no way of knowing which of the variations will prove 
better adapted to the environment. Geneti- theories of evolution in organiza- 
tional forms expressly limit the focus of ccncern to the processes of evolution, 
not its specific outcomes. 

There are at least two important prac-ical implications of these process 
theories of random variation. First, biological analogies for the evolution of 
organizational forms are conceptually attzactive because they offer so com- 
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plete a view of the processes of evolution. If we could identify these 
characteristic organizational routines or competence elements, we could more 
accurately and unambiguously classify firms according to their similarities 
and differences. With respect to variation, we would be able to tell more 
clearly when one had occurred. Since routines and competencies are also the 
characteristics of organizations that change when an organization is involved 
in the evolution of an -organizational form, we might be able to track the 
variation process itself. 

A major drawback to these theories, of course, is the empirical elusiveness 
of routines and competencies. Only limited attempts, even by these authors 
themselves, have been made to identify existing routines or competencies in 
organizations. Winter (1990:276) discusses the benefits of "careful descrip- 
tion of prevailing routines," but does not mention any such efforts specifical- 
ly. McKelvey presented a history of Ancient Mesopotamia to show the 
evolution of different organizational forms, and he especially emphasized the 
isolating events (e.g. floods, wars, and the physical movement of pop- 
ulations) that could both nourish and protect the organizational form varia- 
tions. Presumably, the variations derived from some combining and 
recombining of existing competence elements. Retrospective examinations of 
such time-distant events, however, cannot identify the competence elements 
or any new combinations that account for new organizational forms. 

Hannan & Freeman (1989) questioned whether even an exhaustive delinea- 
tion of organizational competence elements would be able to distinguish 
between different organizational forms. They posed the example of public 
versus private universities and wondered whether differences between the two 
could be discerned even with a complete list of the competencies of the 
organization's members. Presumably, members' competencies would overlap 
almost completely. Of course proponents of the genetics analogy might 
respond that such evidence would imply that the public-private distinction for 
universities is not indicative of distinctive organizational forms. Nevertheless, 
Hannan & Freeman's question does raise an issue about how to determine 
which of the many competencies carried by organizational members are 
actually reflective of the dominant competence of the organization. It seems 
that much thick description of intuitively different organizations will be 
necessary before rigorous taxonomic research can be conducted. 

The second practical implication of these theories expressly does not rely 
on any empirical demonstration of routines or competencies. If variations are 
random, then we can examine patterns of organizational form evolution 
without needing to identify the specific processes that generate the variations. 
For example, we may be able to identify contextual conditions in organiza- 
tions and in environments that are more and less facilitative of variations 
being generated (Aldrich 1979). Under some conditions, knowledge about the 
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patterns of evolutionary differentiation mzy lead to supportable assumptions 
about the nature of processes that produce the variations themselves 
(Romanelli & Tushman 1986). For exampie, some organizations’ search and 
evaluation routines may be more producti~e of variations than those of other 
organizations. Such organizations might be identified through investigation of 
patterns in variation. Overall, emphasis on patterns in variation supports 
investigation of rates and locations of variations. The specific nature or 
content of the variations, which is a relevant concern of some of the other 
perspectives, can here be ignored. 


ENVIRONMENTAL CONDITIONING: A CONSTRAINED 
VARIATION VIEW 


Where the genetics analogy emphasizes microlevel processes that produce 
specific organizational variations, other perspectives from economics (e.g. 
Schumpeter 1939, 1950, Dosi 1988) and sociology (e.g. Stinchcombe 1965a, 
Hannan & Freeman 1989) examine how conditions of environments may be 
more or less productive of organizational form variations, and predictably so. 
Three related approaches to the study of constrained variation in organization- 
al forms capture most of the relevant research and theory: (a) Creative 
destruction, described by Schumpeter (1939, 1950) as a competitive process 
that advances the evolution of new orgarazational forms in response to the 
development of new technologies, emphases industrial organization as a key 
influence on the likelihood of evolution. <b) Environmental imprinting, de- 
scribed by Selznick (1949, 1957) and Stimchcombe (1965a) as a process by 
which new organizations forms come tc reflect environmental conditions 
during the period of the form's evolution, emphasizes the historical condition- 
ing of organizational forms. (c) Finally, organization speciation, the most 
recent argument in the constrained variction view (Freeman 1982, 1986, 
Romanelli 19893, Lumsden & Singh 1€90), examines organizations and 
populations as producers of organizational form variations. 


Creative Destruction 


Probably the intellectual father of all research on macro level change in 
organizational environments is Joseph Sch mpeter. Especially in his writings 
on capitalism and on business cycles (1€39, 1950), Schumpeter advanced 
propositions linking changes in the structures of industries (e.g. concentra- 
tion) to rates of new industry formation. H2 considered technological innova- 
tion to be the primary factor in the creetion of possibilities for structural 
change. New technologies, which existiag organizations typically do not 
adopt and assimilate easily, offer the greztest possibilities for new resource 
combinations, i.e. for new forms of orgaaization. As the new technologies 
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support competitive activity that challenges existing organizations, the older 
firms begin to fail and in so doing free up even more resources for more new 
combinations, and so on. In his famous phrase, “the gales of creative destruc- 
tion," Schumpeter expressed the dynamic linkages among the demise of 
existing organizational arrangements, the invention of new technologies, and 
the rise of new organizational forms. 

Schumpeter's ideas have spawned literally hundreds (perhaps thousands) of 
empirical studies in the economics literature, although most of them have 
focused on factors that affect R&D expenditures in existing organizations and 
not on rates of technological innovation per se or on rates of organizational 
birth. Expenditures on R&D are presumed to relate to rates of technological 
innovation, but only a few studies have examined this relationship directly. 
Due to this literature's emphasis on the activities of large, ongoing organiza- 
tions, and its focus on technological innovation and not the invention of new 
organizational forms as the outcome of interest, I will not examine this 
literature further here. Kamien & Schwartz (1982) and Dosi (1988) provide 
superb reviews. 

A few themes in organization theory, however, do strongly reflect 
Schumpeter's ideas about competition among populations. Recent studies 
from population ecology compare the relative advantages of competing pop- 
ulations based on their abilities to exploit evolving resource conditions. For 
example, a series of studies by Barnett and Carroll (Barnett & Carroll 1987, 
1990, Barnett 1990) on competition and cooperation among forms of tele- 
phone companies during the early years of that industry clearly indicate that 
interactions and relationships of organizational forms affect the ongoing 
diversity of forms. Similarly, a study of competition for human resources 
among voluntary associations in five nations by McPherson & Smith-Lovin 
(1988) showed how differences in organizational forms affect the stability of 
the niche structure in an organizational community. In their study of the 
ecology of organizational foundings in American labor unions, Hannan & 
Freeman (1987) showed how the growth of industrial unions restrained the 
founding rate of craft unions. These studies focus on the interdependent 
relationships of existing populations. Thus, their findings pertain principally 
to factors that either maintain or (in most cases) reduce the diversity of 
organizational forms in an environment. This emphasis is in keeping with 
ecologists’ prevalent concern for specifying selection mechanisms. These 
same competitive processes might also be examined, however, for their effect 
on the larger resource space and, thus, on the evolution of new organizational 
forms. 

Tushman & Anderson (1986) took up this theme explicitly in their study of 
the differential likelihoods that old or new populations of firms will adopt and 
promote technological innovations. Particular innovations, they argued, will 
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either enhance or destroy the established sompetencies of existing organiza- 
tions. Based on a study of technological innovations in three industries— 
minicomputers, airlines, and cement—tkeir work showed that when new 
technologies enhance the competencies of existing organizations the existing 
organizations will adopt the technologies. When the new technologies are 
destructive of existing competencies, hosvever, new organizations will be 
established with competencies explicitly tailored to the new technological 
conditions. Though Tushman & Anderson did not directly discuss the com- 
petitive demise of old populations due to activities by new populations, their 
findings are directly in keeping with Schumpeter’s theory of competition in 
the evolutionary process. 


Environmental Imprinting 


A very different perspective on the evolition of new organizational forms 
develops from the arguments of Selznick. (1957) and Stinchcombe (19652), 
both of whom argued that organizations tend to take on the characteristics of 
people and environments that surround their early establishment. Selznick 
emphasized the influence of organizationzl founders on characteristics of the 
early organization, although he recognized that the decisions of the founders 
are constrained by environmental condztions. Stinchcombe, by contrast, 
emphasized the influence of environmental conditions, although he stressed 
an important role for individual founders im the process of organizational form 
variation. Together, these arguments have come to be known as the organiza- 
tional imprinting hypothesis. Stinchcombe.(1965a:153) stated the hypothesis 
most clearly, when he wrote: “The organizational inventions that can be made 
at a particular time in history depend upon the social technology available 
at the time. . . . Then, both because they can function effectively with 
those organizational forms, and because the forms tend to become institu- 
tionalized, the basic structure of the orgenization tends to remain relatively 
stable." 

Most of the research on organizational amprinting is found in case histories 
of the establishment and early years oz innovative organizations. Many 
notable analyses of the origins of innovacive organizations suggest a strong 
association between environmental conditions and the creation of a new 
organizational form. For example, Selznz:k (1949) described how changing 
political philosophies in favor of more localized government led to the 
founding of the Tennessee Valley Authorty and shaped administrative char- 
acteristics of that organization long into the future. Kimberly (1979) described 
how an expected shortage of doctors ir the United States, coupled with 
increasing unhappiness about trends toward specialized medicine, led to the 
formation of an innovative new medical school expressly established to train 
general practitioners. These histories desc-ibe specific occurrences of innova- 
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tion in response to specific environmental conditions, and emphasize the 
persistence of the innovative organizational arrangements over a very long 
time. Taking a slightly different perspective, recent excellent accounts of 
crises and declines faced by older organizations—e.g., Sears (Katz 1987) and 
American automobile manufacturers (Halberstam 1986)—trace modern diffi- 
culties to the organizations' ideological commitments and arrangements that 
were established in response to early environmental conditions. All of these 
histories—and there are many more like them (e.g. Hall 1976, Messinger 
1955)—4escribe the origins of highly innovative organizations, i.e. firms that 
were founded to produce a new product or service, to employ a new technolo- 
£y, or to experiment with fundamentally new organizational arrangements. 

While the number of case studies showing imprinting at the level of 
particular innovative organizations is impressive, and supportive of imprint- 
ing as an important factor in the evolution of new organizational forms, large 
sample studies are needed to demonstrate a general effect. Stinchcombe's 
(1965a) demonstration study which showed that modern industries tend to 
reflect the forms of organization innovated during the period of the industries’ 
establishments represents one of the very few attempts. Stinchcombe col- 
lected data on labor force characteristics (e.g. per cent unpaid family mem- 
bers, clerical as per cent of administrative workers) in a number of different 
industries, and he classified the industries according to the general historical 
period of their emergence. What appeared was a relatively close association 
between modern labor force characteristics and the time of the industry's 
emergence. Results of this demonstration must be viewed with caution since 
we cannot determine whether the modern day forms are the same as those 
exhibited earlier in the lifetimes of the industries. Nevertheless, the sub- 
stantial similarity of organizational forms across industries that are similar 
only in their period of emergence suggests a rather strong influence of 
initiating conditions. Kimberly (1975) presents probably the best analysis of 
the imprinting hypothesis as it relates to the evolution of new organizational 
forms in his study of 123 sheltered workshops in the United States. To begin 
with, Kimberly explicitly tied the characteristics of the different forms—i.e. 
sheltered workshops that provided long-term work for disabled veterans 
versus rehabilitation organizations that taught skills to allow the disabled 
person to function independently in the labor market—to changes in prevail- 
ing attitudes regarding the appropriate treatment of the disabled. Moreover, 
Kimberly identified the forms that were exhibited by these firms at their 
founding and demonstrated a rather tight association in time between changes 
in environmental conditions and the organizational form innovation. Perhaps 
most important, Kimberly showed that sheltered workshops did not tend to 
alter their forms even as the rehabilitation organizations began to pervade the 
environment and attract the lion's share of public resources. 
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Difficulties involved in collecting detailed longitudinal data about environ- 
mental and organizational characteristics (especially for new organizations) is 
one possible reason that there are so few scudies of imprinting at the popula- 
tion level. Another reason, however, may Je the scarcity of theory regarding 
the characteristics of environments that may most likely Pügenaes a new 
organizational form. 

One theme suggested regularly is political upheaval (es: Stinchcombe, 
1965a, Aldrich 1979, Carroll & Huo 1986, Carroll, Delacroix & Goodstein 
1990). Stinchcombe (1965a) discussed state revolution as one clear means by 
which established control over the distribition of resources can be broken. 
Emphasis on revolution is reflective of how much stability is believed to exist 
in organizational arrangements, including allocations of resources among 
established power elites. Political revolution frees resources for use in new 
combinations. Carroll et al (1990) develooed a set of propositions that link 
political upheaval to explicit forms of orgamization that will likely emerge and 
thrive; for example, they proposed that r-strategist organizational forms will 
outcompete K-strategists during periods o7 upheaval. Organizations exhibit- 
ing an r-strategy are those that are most able to exploit resources quickly when 
they first come available (Brittain & Freeman 1980); K-strategists are efficient 
utilizers of resources as the environmenta! space reaches carrying capacity. 
Arguing that political upheaval fundamentally represents the replacement of 
one set of K-strategists for another, Carroll et al also proposed that the 
character of the K-strategists before and after the revolution will reflect their 
respective political institutions. The careful system of propositions developed 
in this paper represents an important advarce in the imprinting literature as it 
begins to specify particular relationships >etween environmental conditions 
and the rates and kinds of organizational irnovations that are likely to occur. 

Fligstein's (1990) analysis of the effect of the Cellar-Kefauver act on 
merger activity in the United States shed a somewhat different light on the 
processes and effects of political activity on kinds of organizational forms. 
The legislation itself resulted from political activism against the increasing 
size and power of large, horizontally and vertically integrated corporations. 
The act effectively prevented mergers of closely related organizations that 
could substantially increase a corporatior's control over economic activity 
within industries. It had the unintended corsequence, however, cf stimulating 
mergers among corporations in unrelated mdustries. Large, diversified orga- 
nizations, controlled mainly by the tools of financial as opposed to sales 
analysis, were established as a new organ:zational form. Fligstein's study is 
more reflective of institutional aspects of political activity than competitive 
aspects, which are a key focus of the ecological studies. Mezias (1990) 
provided another example of political and institutional effects on organiza- 
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tional form in his study of influences on regulators of accounting reporting 
standards and firms’ adoptions of different methods. 

Short of revolution or legislation at the state level, Aldrich (1979) has 
described how technological innovation, entrepreneurs’ access to power and 
wealth, and the changing structure of labor markets, can disrupt established 
resource distributions to increase the probability of organizational form varia- 
tion. Brittain & Freeman (1980) and Boeker (1988, 1989) offered evidence of 
imprinting based changes in the industry environment. Brittain & Freeman’s 
analysis of changes in organizational forms over several years in the semicon- 
ductor industry shows r-strategists to be most effective early when resources 
are abundant; later K-strategists displace the r-strategists as the carrying 
capacity of the resource space is approached. This analysis suggests that 
political upheaval and technological innovation may be similar phenomena in 
terms of their main effects on types of organization forms that will be 
established. Boeker links more specific characteristics of organizations’ early 
strategies, such as emphasis on marketing or manufacturing, to particular 
developments in the semiconductor environment. Finally, taking a very dif- 
ferent approach to a slightly different phenomenon—ethnicity and entrep- 
reneurship—Aldrich & Waldinger (1990) analyzed historically conditioned 
opportunity structures and group characteristics as key influences on the rise 
of ethnic entrepreneurial enclaves. Opportunity structures influence the access 
that entrepreneurs can gain to traditional markets. Group characteristics tend 
to predispose certain ethnic groups to establish entrepreneurial networks. 


Organizational Speciation 


Recently, several authors (e.g. Astley 1985, Brittain & Freeman 1986, 
Romanelli 1989a, Hannan & Freeman 1986, 1989, Lumsden & Singh 1990) 
have discussed the evolution of new organizational forms in terms of con- 
ditions and processes of organizational speciation. This perspective empha- 
sizes organizations and the populations in which they compete as “producers” 
of new organizations and new organizational forms. Although there are 
differences in the specific approaches taken by the different researchers, two 
components seem central to all perspectives. First, organizations are consid- 
ered to differ in their capacities for producing new organizations based on 
differential access to information about opportunities for new organizational 
forms. Second, the distribution of information and other resources that is 
necessary to the evolution of new organization forms is expected to change 
over time. Thus, the organizational origins of new forms will change over 
time. 

Several authors have described specific characteristics of organizations that 
should affect how frequently they produce new organizations. Stinchcombe 
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(1965b) proposed that an “organization-creating organization” would have a 
relatively large fund of resources (e.g. moaey, legitimacy) overall, that some 
of these resources would be explicitly free from vested interests, and that the 
firm would operate in a variable environment (e.g. multiple product markets, 
volatile sales growth). In different ways, these variables affect organizational 
members’ abilities to perceive opportunit es for innovation as well as their 
abilities to exploit them. Organizations that are high in these characteristics, 
he argued, will tend to produce more organizational variations. Brittain & 
Freeman (1986) considered five characte-istics of organizations that might 
affect the rates at which they produce nev organizations. Three characteris- 
tics—-CEO succession where an outsider essumes the office, takeover of the 
firm by an organization’ outside of the firm’s primary competence, and 
changes in organizational growth that hinder individuals’ mcbility—-were 
proposed as a positive influence on new-firm production in that they decrease 
the attractiveness of career prospects for tne individual within the firm. The 
other two characteristics—whether the firm is a specialist or generalist, and 
whether it was a first entrant in at leas one of its product groupings— 
influence the amount and variety of in"ormation that is available about 
environmental conditions and entrepreneurial opportunities, and so the rate of 
new firm production. In a study of the ckaracteristics of the entrepreneurial 
source organizations for 351 semiconductor producers that were founded in 
Silicon Valley between 1955 and 1981, Brittain & Freeman found general 
support for these hypotheses. 

Extending these arguments, Freeman (1682, 1986) suggested that organiza- 
tional characteristics affect not only the rates at which existing firms would 
produce new firms, but also the form of the new organizations. He argued that 
“People who start Catholic convents do not do so after toying with the notion 
of beginning cement plants, advertising agencies, or universities" (1982:16). 
Individuals are restricted in the information they possess by their organiza- 
tional experiences. Thus, we might expect them to found organizations in 
populations that are either the same as or closely related to those they have 
worked in previously. 

The concept of an organization commun ty was discussed by Carroll (1984) 
and Astley (1985) as crucial to developing theory about how new forms of 
organization arise. The organizational community, which is defined as a set of 
interrelated organizational populations, constitutes the environment within 
which a new organizational form (i.e. a new population) becomes established. 
If existing organizations are stable, in botk their forms and their relationships 
to one another, they will tend not to exoloit any new resources that may 
become available in the environment at large. Thus new spaces open. Hannan 
(1986) and Hannan & Freeman (1986, 1982) emphasized that the fundamental 
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"branching" occurs in the environment. New organizations, expressing new 
organizational forms, will arise to exploit the resources. 

Some theorists have considered conditions that might cause an environment 
to branch and in so doing create a resource space for a new organizational 
form. Astley (1985) emphasized technological innovation as the crucial 
space-creating variable. Romanelli (19892) argued that virtually any event or 
development that could fundamentally alter existing flows of resources—e.g. 
changes in social values, the discovery or depletion of natural resources, 
changes in the demography of a human population, economic growth or 
decline, and so on—can effect a change in organizational resource space. 
Moreover, she suggested that existing arrangements of organizations and 
populations will influence characteristics of the emerging resource space. 
Even if firms in existing populations do not adapt to changing environmental 
conditions, they will be differentially likely to tap information regarding the 
change; thus individuals will have differential access to information based on 
their employment. Moreover, members of these organizations will differ in 
their abilities to attract resources for the innovation of new organizational 
forms. Thus, the *opening" (Astley 1985) of new resource spaces may depend 
on the dynamic interaction of exogenous change and existing constraining 
conditions. 

Little empirical work has been done to support or add detail to the specia- 
tion perspective. Marrett's (1980) description of the formation of women's 
medical societies showed that they tended to be founded by women doctors 
who possessed more and stronger ties to these organizations, usually via 
association with male members of the organizations. Aldrich & Zimmer 
(1986) emphasized the importance of entrepreneurial networks in facilitating 
resource and information combinations. Astley & Fombrun (1983) described 
the relationship of various populations in the communications industry—e.g. 
motion picture production, television broadcasting, publishing—to the video- 
tape production and distribution resource space. Detailed studies of the 
organizational origins of new organizations are needed to validate and extend 
the arguments of this perspective. 

There are two overlapping but competing arguments that characterize these 
three perspectives on how environments constrain the rates and kinds of 
organizational forms that are invented. One perspective, articulated by Han- 
nan & Freeman (1986, 1989), emphasizes the creation of a new resource 
space in the environment as primary to the invention of a new organizational 
form. Both creative destruction and environmental imprinting reflect this 
view, although the perspectives differ substantially in their arguments about 
how new spaces are created. Ecologists (e.g. Hannan & Freeman 1989) 
assume a quite direct relationship between characteristics of the new resource 
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space and the nature of organizational form that will arise. A very different 
understanding is represented in the idea that organizations and populations are 
producers of new organizational forms. While some relatively “free” environ- 
ment is also assumed in this perspective, the organization-as-producer hy- 
pothesis emphasizes the differential distribution of information over existing 
firms and populations as the critical envircnmental variable that determines 
the particular innovation in organizational ferm that will arise. The character- 
istics and information-gathering proclivities of existing organizations exert a 
substantial influence on whether a new environmental space is created or even 
perceived. 

It is somewhat tempting to suggest that these two views are compatible by 
saying that the environmental branching hypothesis identifies where the 
opportunities for an organizational form innovation will exist while the 
organizational speciation hypothesis expla ns the processes that realize the 
opportunities. Integrations of this sort, however, tend to obscure real theoreti- 
cal differences. At this early stage of research and theory on the evolution of 
new organizational forms, it seems most appropriate to allow all differences, 
i.e. all diversity, to remain undiluted. 


EMERGENT SOCIAL SYSTEMS 


The final perspective on the evolution of new organizational forms avoids 
grappling with the question of specific antecedent conditions—i.e. organiza- 
tional genes, environmental conditions, or organizational spawning 
grounds—almost altogether. Van de Ven & Garud (1989) draw on Etzioni 
(1963) to present an “accumulation” or “epigenetic” theory of the evolution of 
new organizational forms. As they argue, rew "environmental niches do not 
pre-exist . . . they are socially constructed comains through the opportunistic 
and collective efforts of interdependent actors in common pursuit of a tech- 
nological innovation" (1989:205). This perspective views variations in orga- 
nizational forms as arising dynamically throagh the cumulative interactions of 
entrepreneurs and organizations toward the establishment of a new industry 
system. 

The process begins with the largely independent activities of entrepreneurs 
who perceive an opportunity for creating a mew organization and who begin to 
accumulate the social and material resources that are necessary to build the 
business. Over time, as the independent eatrepreneurs seek resources from 
similar sources and otherwise travel in the same circles (e.g. industry associa- 
tions, trade shows, academic conferences), their paths begin to intersect. 
Interdependencies are established that benefit the actors directly through 
sharing of information and resources, whick speeds the efforts of all entrepre- 
neurs, and through expressing an emergent legitimacy, i.e. strength in num- 
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bers. Van de Ven & Garud argued that these interdependencies help isolate 
the emerging organizational form from direct competitors, or others whose 
vested interests might be threatened, by reducing the needs of the new firms to 
draw resources from the existing organizations. Once isolated, the partici- 
pants themselves begin to compete over alternative technological paths. 
Different paths lead to different clusters of interdependencies with organiza- 
tions outside the focal group. Over time, a network of competing and 
cooperating organizations, i.e. an industry, emerges that is defined by orga- 
nizations' productive or service relationships to a technological innovation. 

Aldrich & Wiedenmayer (1989) support this perspective and linked levels 
and kinds of collective action to changes in population density. Especially for 
new organizational forms, denser populations result in more frequent contacts 
among members, which should enhance learning about the organizational 
form. Collective action can lower entry barriers for would-be entrepreneurs, 
thus increasing the size of the populations and potentially the legitimacy of the 
organizational form. Collective action can also dampen competitive rivalry 
through institutionalizing, perhaps via trade associations, certain competitive 
and employment practices. 

Like most other perspectives on the evolution of organization forms, this 
perspective has yet received only a little empirical attention. Rappa (1989) 
described the development of the research community surrounding early work 
on compound semiconductors. His findings demonstrate how the dynamics of 
communication flows among researchers, and how changes in those com- 
munications flows, affect both the evolution of the technology and the 
development of an industry to support and commercialize the research. In a 
later study, Rappa & Debackere (1990) provided a detailed analysis of the 
very early evolution of a research community surrounding neural network 
technologies. This study emphasized flows of researchers into and out of the 
neural network community over time. The authors described how resources 
are "bootlegged" from laboratories devoted to other technological pursuits. 
They identified core group members who sustain the community through 
cycles of enthusiasm (the "bandwagon" effect) and despair. Finally, they 
described the eventual emergent "grapevine" that developed over public and 
private sectors and over different nations. Through these thick descriptions of 
the emergence of research communities, a view of industry evolution as a 
dynamic social process is gained; i.e. the evolution of a technological innova- 
tion is integrally related to the evolution of a new organizational form. 
Technology is not simply the necessary antecedent as typically assumed by 
most organizational researchers. 

Garud & Van de Ven (1989) examined the cochlear implants industry to 
describe isolating processes that carry a research community into the realm of 
economic enterprise. Initially, as a critical mass of individuals who believed 
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in the superior benefits of the cochlear ir»lant technology identified them- 
selves, special conferences and training programs emerged to facilitate the 
sharing of information, which thus sped development of the technology 
overall. These activities helped achieve tte endorsement of the American 
Medical Association. Then, to further estaEdsh the distinctiveness of cochlear 
implants as alternative technology, special standards of evaluation and sepa- 
rate panels of evaluators were established -ithin the United States Food and 
Drug Administration and the American Speech and Hearing Association. In 
other words, the separation of cochlear implant technology from other hear- 
ing-aid technologies was formally and str-cturally institutionalized. 

The practical implications of this perspe=tive are twofold. First, according 
to this perspective, technological innovat on may not be taken as a given 
incident around which new forms of orzanization evolve. Technological 
innovation is a dynamic social process ~hich, as it unfolds, creates the 
resource space that will support the new fms reflecting new organizational 
forms. In other words, in the emergent and initially dispersed research 
communities that form around possibilitiez for innovation, we may find the 
processes of evolutionary branching or speciation that other perspectives take 
as a given or, for some, even a random event. 

Second, neither populations nor organizetions can be used as the sole focus 
of empirical investigation. In contrast tc the organizational genetics and 
environmental conditioning perspectives, -esearch in this emergent social 
processes vein must identify, at least initia y, the human networks that enact 
the evolution of a new organizational form. Organizations, and the pop- 
ulations and communities in which they compete, may present a constraining 
context for the emergence of new technolozical enterprise. According to this 
social systems perspective, however, the c-ntexts are not directly productive 
of any variation. The context merely is the resource pool from which in- 
dividuals and their interactions create nev- organizational forms. 


THE EVOLUTION OF A NEW CRGANIZATIONAL 
THEORY 


If there is anything clear from this review af perspectives on the evolution of 
new organizational forms, it should be that a great number of theoretical 
arguments have been recently formulated. 3Vell over half of the citations in 
this review are to papers and chapters publimed within the last decade. This is 
not surprising in that much of the impetus >r exploring the evolution of new 
organizational forms derives from research and theory in organizational ecol- 
ogy and institutional sociology. Ecology ead at least recent formulations of 
the institutional perspective (e.g., Meyer & Rowan 1977, Zucker 1977, 
Meyer, Scott & Deal, 1983) are themselves fairly new perspectives on the 
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ongoing evolution of organizational populations. An even more striking 
statistic, however, is that virtually all of the citations regarding the evolution 
of new organizational forms specifically are to papers and chapters published 
during the last five years. 

It appears that, as a subject for study, the evolution of new organizational 
forms is undergoing its own speciation process. I will not press this argument 
by trying to identify any combining theoretical genes, though one might note 
the variety of disciplines that are intermingling in these theoretical dis- 
cussions. It does seem possible and instructive, though, to consider some 
branchings in the intellectual environment. 

Three areas of traditional thinking about organizational evolution are not 
represented in these current theories. First, the phrase "life cycle," whether in 
reference to organizations or environments, has not appeared in this review. 
Its absence was not intentional. It is simply the case that these theories of the 
evolution of organizational forms do not assume any developmental stages in 
either the conditions or activities of evolutionary change. O'Rand & Krecker 
(1990:256) pointed out that use of the term life cycle with respect to ecologi- 
cal studies of organizational birth and change evokes "only the broadest 
outlines of the idea of life cycle." Almost no attention has been paid to the 
timing, sequencing, or processes of maturation, although these are the central 
concern of most life-cycle theories. Second, there is only a hint of concern in 
these theories for explaining the "progress" of a society. The social and 
economic well-being of a society is understood as a product of evolutionary 
changes in organizational forms. Nothing in these theories, however, suggests 
that evolution will be synonymous with progress or advancement. Organiza- 
tional diversity is a variable, and the same evolutionary processes may 
account for its increase and decline. 

Final, the debate about whether existing organizations can adapt to 
changing environmental conditions, though not obviated by theories of the 
evolution of organizational forms, is properly relegated to a different category 
of theoretical and empirical concern. While some of the above theories do 
assume that new organizational forms will be invented, more typically by new 
versus established organizations, nearly all allow that the vehicles of evolu- 
tion may differ under different environmental conditions. As evolutionary 
theories explore processes that engender the innovation of new organizational 
forms, new insight will be gained regarding the role of organizational change 
and inertia. 

Intellectual branchings can be identified whenever a particular new theory 
is presented. The striking similarity on the above points of so many different 
theoretical arguments, however, suggests the occurrence of a fundamental 
change. Over the last decade, longitudinal and dynamic analyses of organiza- 
tions and populations have come to dominate empirical work in organizational 


100 ROMANELLI 
sociology. It should not be surprising that rw theory is evolving to explain 
the evolution of new organizational forms 


CONCLUSIONS 


In constructing this review, I have kept ir mind three basic goals. First, I 
sought to describe the various recent theoriesthat have been offered to explain 
the evolution of new organizational forms. =econd, I wanted to organize that 
literature along lines that would identify sone basic differences in perspec- 
tive, but not overly suggest homogeneity c^ arguments within perspectives. 
Because all theories of genesis must postulat» some final origins of variation, 
I grouped these writings according to their riews on the principal sources of 
new organizational forms. Finally, as my -eading of this literature has re- 
vealed the great variety of approaches to urderstanding the evolution of new 
organizational forms, I have tried to reflect tzat diversity. I see no overarching 
themes for integrating the perspectives. At -iis stage of our understanding, it 
seems that the most insights will be gained bz emphasizing differences so that 
the promise of this diversity may be realizec Of course, the challenge now to 
proponents of all these theories is empirical research. 
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Abstract 


This article examines the main theoretical contributions of critical theory, 
poststructuralism and postmodernism. It is argued that these three theories 
offer related perspectives on the shortcomings of positivism as well as new 
ways to theorize and study contemporary societies. Empirical and conceptual 
applications of these perspectives in sociological research are discussed. 
Some of these applications include work in the sociology of deviance, gender, 
media and culture. Finally, implications of these three theoretical perspectives 
for the ways sociologists think about the boundaries and territoriality of their 
discipline are discussed. 


SOCIOLOGY MEETS GERMAN CRITICAL THEORY 
AND NEW FRENCH THEORY 


Critical theory, poststructuralism, and postmodernism are intellectual tradi- 
tions most familiar to people who work in philosophy, aesthetic theory, 
literary criticism, and women's studies. Yet a number of American sociolo- 
gists are beginning to show productive familiarity with the three theoretical 
schools discussed in this paper (e.g. Lemert 1980, Lemert & Gillan 1982, 
Gottdiener 1985, Gottdiener & Lagopoulos 1986, Denzin 1986, 1989, 1990, 
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1991, Brown 1987, Richardson 1988, 199Da,b,c, 1991, Agger 1989a,b,c, 
1990, 1991a,b, 1992, Hazelrigg 1989, Antonio & Kellner 1991). Outside of 
sociology, the interpretive literature on critical theory, poststructuralism, and 
postmodernism is huge and growing (e.g. Jay 1973, 1984a,b, Eagleton 1976, 
1983, 1985, Held 1980, Schoolman 1989, Culler 1982, Weedon 1987, 
Aronowitz 1988, 1990, Harvey 1989, Luxe 1989, 1990, Best & Kellner 
1990). Any humanities-oriented booksto-e is bursting with exegetical 
volumes on Derrida's deconstruction, the Frankfurt School, poststructural- 
feminist film criticism, French feminism, and cultural studies. Derrida has 
become virtually a discipline in his own rigat, not least because he writes so 
densely and allusively (also because of his enormous intellectual charisma; 
Lamont (1987) has addressed the phenomenon of Derridean deconstruction 
sociologically). 

Although the three theories discussed in this paper are inherently and 
sometimes vigorously political, they are often ignored by empiricists not 
because they are leftist (after all, a good deal of the mainstream stratification 
and gender work published in American Sociological Review is vaguely 
leftish) but because they are incredibly, extravagantly convoluted—to the 
point of disastrous absurdity one would think, if reading Derrida’s (1987) 
Glas or Lyotard’s (1989) The Differend (n> typographical error that!). One 
cannot help but wonder why these theorists do not write more clearly and in 
ways that show the empirical (political, cultural, existential) relevance of their 
work more directly. I intend this paper largely as translation, explication, and 
application. As I argue below, the three theories are most relevant for the 
methodological and empirical work they ccn do, even if this is buried deep 
beneath the surface of these writings. 

I begin by developing the main ideas of each of the three theories. I then 
summarize the relevance of these ideas for methodology, research, and 
concept formation in mainstream sociolcgy. Above all, critical theory, 
poststructuralism, and postmodernism are effective as critiques of positivism 
(Stockman 1984), interrogating taken-for-granted assumptions about the ways 
in which people write and read science. They also make potentially useful 
substantive contributions. Although most American sociologists are not wed- 
ded to positivist doctrine, the research and writing they do tend to embody the 
central positivist tenet that it is possible to reflect the world without pre- 
suppositions, without intruding philosophical and theoretical assumptions into 
one’s work. All three theoretical perspectives discussed here reject pre- 
suppositionless representation, arguing explicitly that such representation is 
both politically undesirable and philosophically impossible. 

Although all three theories mount an exnaustive frontal attack on positiv- 
ism, they hold open the possibility of an empirical social science, albeit one 
that operates with decidedly nonpositivist assumptions (see Diesing 1991). 
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Neither poststructuralism nor postmodernism has produced a concrete version 
of this social science. However, during the exodus of the Frankfurt School to 
the United States during and just after World War II the German critical 
theorists (see Adorno 1969, Arato & Gebhardt 1978) did important empirical 
studies that adapted critical theory to the project of empirical social science, 
including but not limited to Adorno et al's (1950) study of the authoritarian 
personality (also see Adorno 1945, 1954, 1974). This work anticipates subse- 
quent applications and adaptations of these three theories to mainstream 
sociology. 


CRITICAL THEORY: MAIN IDEAS 


Critical theory is associated with the Institute for Social Research, established 
in Germany in 1923 and staffed by Theodor W. Adorno, Max Horkheimer, 
Herbert Marcuse, Friedrich Pollock, Leo Lowenthal, and Walter Benjamin 
(see Jay 1973, Hughes 1975, Kellner 1989b). The most important recent 
representative of critical theory is Jurgen Habermas, a student of Adorno and 
Horkheimer who departed significantly from certain positions of the founders 
(see Habermas 1970, 1971, 1975, 1979, 1981a,b, 1984, 1987a,b). For 
representative studies of the origins and meaning of critical theory, see Jay 
1973, Agger 1979, Connerton 1980, Kellner 1989b. Also see Slater 1977, 
who offers an orthodox-Marxist appraisal of the Frankfurt School. 

Critical theory as developed by the original Frankfurt School attempted to 
explain why the socialist revolution prophesied by Marx in the mid-nineteenth 
century did. not occur as expected. Marcuse, Adorno, and Horkheimer 
thought that they had to reconstruct the logic and method of Marxism in order 
to develop a Marxism relevant to emerging twentieth-century capitalism. 
They did not believe that they were recanting Marx's basic understanding of 
capitalism as a self-contradictory social system—e.g. see Horkheimer's 1937 
(1972) essay on this issue, "Traditional and Critical Theory," in which he 
spelled out the basic parameters of critical theory. In particular, the Frankfurt 
School theorists, following the Hungarian Marxist Georg Lukacs (1971), 
attempted to link economic with cultural and ideological analysis in explain- 
ing why the revolution expected by Marx did not occur. Like Lukacs (who 
used the term reification to refer to deepened alienation in an emerging “late” 
capitalism), the Frankfurt theorists believed that Marx underestimated the 
extent to which workers' (and others") false consciousness could be exploited 
to keep the social and economic system running smoothly. Lukacs and 
the Frankfurt thinkers agreed with Marx that capitalism over time tends to 
develop internal economic irrationalities (e.g. the concentration and cen- 
tralization of productive wealth at the expense of workers who are thrown 


108 AGGER 


out of work as a result and thus cannot consume the commodities that their 
labor produces). The Frankfurt School thouzht that capitalism in the twentieth 
century was beginning to develop effective coping mechanisms which 
allowed it to forestall the cataclysmic erupion of these periodic crises into a 
wholesale socialist revolution. 

In particular, capitalism deepens false ccnsciousness, suggesting to people 
that the existing social system is both inevitable and rational. Marx (nd:76— 
88) already provided for the possibility of false consciousness in his famous 
analysis of commodity fetishism in Volume One of Capital. According to 
him, commodity fetishism (typically misurderstood to mean people's obses- 
sion with commodity consumption—consumerism) refers to the way in which 
the labor process is mystified, appearing no: to be a purposeful construction of 
willful human beings. The particular character of false consciousness in a 
society founded on commodity fetishisn:—capitalism—3is the inability to 
experience and recognize social relations zs historical accomplishments that 
can be transformed. Instead, people “false y" experience their lives as prod- 
ucts of a certain unchangeable social nature. 

The deepening of commodity fetishism leads to what Lukacs called reifica- 
tion and the Frankfurt theorists domination. Domination in Frankfurt ter- 
minology is a combination of external exploitation (e.g. the extraction of 
workers' surplus value—explored exhausttvely in Capital) and internal self- 
disciplining that allows external exploitation to go unchecked. In sociological 
terms, people internalize certain values and norms that induce them to partici- 
pate effectively in the division of productive and reproductive labor. Classical 
non-Marxist social theory (Comte, Durkheim, Weber, Parsons and now the 
neo-Parsonians) explores what Parsons (1637; see O'Neill 1972a) called the 
Hobbesian problem of order: Why do people obey in organized industrial 
societies? The Durkheimian-Weberian-Parsonian answer is that people obey 
because they share certain common valves and beliefs (e.g. Durkheim's 
collective consciousness) that explain the world to them in a rational way. In 
particular, people believe that they can achieve modest personal betterment by 
complying with social norms but that largz-scale social changes beyond this 
are impossible. 

The Frankfurt thinkers argued that these common values inculcating obedi- 
ence and discipline contradict people's ob’ective interest in liberation. These 
values function ideologically to foreshorten people's imagining of what is 
really possible in an advanced technological society. Marcuse (1955) argues 
that domination must be redoubled in late capitalism in order to divert people 
from the increasingly realistic prospect of an end to scarcity and hence toil. 
What he (Marcuse 1955:32—34) calls surplus repression imposes discipline 
from the inside, inducing people to keep their noses to the grindstone, have 
families, and engage in busy consumerism. People are taught to fulfill their 
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needs through repressive desublimation, exchanging substantive sociopoliti- 
cal and economic liberties for the “freedoms” of consumer choice so abundant 
today (see Marcuse 1964:4—6). 

The Frankfurt thinkers explained the surprising survival of capitalism in 
terms of deepened ideologies—domination, in Frankfurt terms. In particular, 
they target positivism as the most effective new form of capitalist ideology. In 
the 1940s Horkheimer & Adorno (1972) in Dialectic of Enlightenment trace 
this new ideology all the way back to the Enlightenment. Although they 
support the Enlightenment's effort to demystify religion and mythology, the 
particular model of enlightenment grounded in positive science was in- 
sufficient to banish mythology once and for all. They argue instead that the 
positivist theory of science has become a new mythology and ideology in the 
sense that it fails to understand its own investment in the status quo. They do 
more, however, than contest positivism as a theory of scientific investigation: 
They argue that positivism has become the most dominant form of ideology in 
late capitalism in the sense that people everywhere are taught to accept the 
world “as it is," thus unthinkingly perpetuating it. 

Horkheimer and Adorno, like Marcuse (1964), reject positivism as a 
worldview of adjustment. Positivism suggests that one can perceive the world 
without making assumptions about the nature of the phenomena under in- 
vestigation. Its notion that knowledge can simply reflect the world leads to the 
uncritical identification of reality and rationality: One experiences the world 
as rational and necessary, thus deflating attempts to change it. Instead, the 
critical theorists attempt to develop a mode of consciousness and cognition 
that breaks the identity of reality and rationality, viewing social facts not as 
inevitable constraints on human freedom (as they were for Durkheim 
(1950:1—13)— social facts) but as pieces of history that can be changed. 
Dialectical imagination (Jay 1973) is the ability to view the world in terms of 
its potential for being changed in the future, a hard-won ability in a world that 
promotes positivist habits of mind acquiescing to the status quo. 

Positivism functions ideologically where it promotes passivity and fatal- 
ism. Critical theory targets positivism both on the level of everyday life and in 
social theories that reduce the social world to patterns of cause and effect. In 
this sense, a good deal of bourgeois social science comes under sharp attack 
by the Frankfurt School (e.g. Institute for Social Research 1972) for lacking 
the sort of dialectical imagination that enables social scientists to look beyond 
the appearance of given social facts toward (and as a way of achieving) new 
social facts—the end of class society, patriarchy, racism, and the domination 
of nature. Even Marxism has become too positivist, according to the Frank- 
furt School, where it has portrayed the downfall of capitalism as inevitable 
according to what Marx called economic "laws of motion." Whether Marx 
himself was a positivist is difficult to determine, given the range of his 
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expressions on epistemology (e.g. Marx 19:1). What is certain is that Marx 
was very much a child of the Enlightenment. who believed that science could 
conquer uncertainty and thus bring about a better world. What is also certain 
is that Marxists after Marx (especially those who dominated the Second and 
Third, or Communist, Internationals) recon=tructed Marx's more dialectical 
social theory along the line of positivist materialism (see Lichtheim 1961, 
Agger 1979). This began with Marx’s close collaborator, Friedrich Engels 
(1935), who.inaugurated a tradition that gatrered momentum (see Lenin 1952 
and Stalin 1940) until the death of Stalin amd until Marxists in the west had 
access to Marx’s (1961) early Economic. ana Philosophical Manuscripts (see 
Marcuse 1973) in which he articulated a nendeterminist historical material- 
ism. 

Frankfurt critical theory positioned itself against positivism of all kinds, 
notably the Marxist variety. For his part, H=bermas (1971), more decisively 
than his earlier Frankfurt colleagues, founc positivism writ large in Marx’s 
own oeuvre. Habermas argued that Mar» failed to distinguish carefully 
enough between knowledge gained from causal analysis and knowledge 
gained from self-reflection and interaction. 7s a result, Marxism has not been 
able to secure an adequate ground in voluntarism, instead falling back on the 
fatalism of positivist determinism. Haberras disagrees with Horkheimer, 
Adorno, and Marcuse that Marx was acteally an opponent of positivism 
himself. As a result, he argues, we must ~ork even harder to reconstruct 
Marx’s historical materialism in a way that gives more credence than Marx 
did to the categorical difference between knowledge gained from self- 
reflection and knowledge gained from causal analysis and technique. For 
Habermas, this reconstructed historical materialism has taken the form of his 
(1984, 1987b) communication theory, in vnich he attempts to shift critical 
social theory, like all western philosophy, from what he calls the paradigm of 
consciousness to the paradigm of commumecation, thus enabling workable 
strategies of ideology-critique, community building, and social-movement 
formation to be developed. 

Habermas’ reconstruction of critical theo-y has been especially compelling 
for critical social theorists because he has mastered and integrated a wide 
range of theoretical and empirical insighrs, all the way from traditional 
Marxism and psychoanalysis to Parsoniem functionalism and speech-act 
theory (see McCarthy 1978). Habermas has 3elped legitimize German critical 
theory in the university through his enormoss erudition and his willingness to 
engage with diverse theoretical and polit zal traditions (where his earlier 
Frankfurt cohorts were much more dismisz.ve of bourgeois philosophy and 
social science because of their accommoda- onism). Some (Agger 1976; but 
see Wellmer 1976, Benhabib 1987) argue ‘iat Habermas has seriously trun- 
cated the emancipatory agenda by drawing a heavy line between’ self- 
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reflection/communication and causality/technical rationality. One upshot of 
Habermas's categorical distinction has been to limit agendas of social change 
to the realm of self-reflection/communication, in which people rationally 
discuss alternative social policies and attempt to build consensus about them. 
His Frankfurt colleagues, like the early Marx, wanted to change not only 
deliberative policy processes but also the social organization of science and 
technology. Habermas (1971) rejects Marcuse, Adorno, and Horkheimer's 
view that we can change not only social policy but our whole technological 
interaction with nature. Habermas (1971:32—33) calls this view “a heritage of 
mysticism." His resulting critical communication theory is closer to the 
parliamentary social democracy of Eduard Bernstein (1961) and, later, the 
Scandinavians, than it is to traditional Marxist concepts of class struggle. 

That is not a sufficient reason in itself to reject Habermas’s reconstruction 
of historical materialism, especially at a time when leftist certainties about the 
inevitability of socialism's triumph over capitalism are being severely tested. 
One might reasonably respond that Soviet statism since Lenin (1973, nd) 
never resembled the mature socialism or communism advocated by Marx. 
And one might also acknowledge that Habermas's (1981b) "new social 
movements" theory is a fruitful empirical as well as political contribution to 
an ossified Marxism that excludes consideration of aspects of domination 
typically ignored by the white male left, notably domination based on sex and 
race. This is also a potentially significant substantive contribution to sociolo- 
y, along with the other Frankfurt contributions in the realms of state theory 
and cultural analysis. 

The most lasting methodological contribution of critical theory to social 
Science is the way it attunes empirical social researchers to the assumptions 
underlying their own busy empiricism. Sometimes, as Horkheimer & Adorno 
(1972) indicate in Dialectic of Enlightenment, the seeming avoidance of 
values is the strongest value commitment of all, exempting one's empirical 
claims from rigorous self-reflection and self-criticism. It is in this sense that 
the Frankfurt School's analysis of mythology and ideology can be applied to a 
positivist social science that purports to transcend myth and value but, in its 
own methodological obsessions, is mythological to the very core. 


POSTSTRUCTURALISM: MAIN IDEAS 


A brief discussion of the main ideas of poststructuralism assumes that we can 
cleanly separate poststructuralism from postmodernism. Unfortunately, we 
cannot. Primers (e.g. Culler 1982, Sarup 1989, Best & Kellner 1990) on the 
subject(s) cut the theoretical pie in any number of ways: Although most agree 
that Derrida is a poststructuralist (even though he does not identify himself as 
such), Foucault, Barthes, and Lyotard can be claimed by either camp and 
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often are. And the French feminists (Kristzva 1980, Irigaray 1985, Cixous 
1986) are sometimes viewed as proponents <f poststructuralism (e.g. Weedon 
1987). The lack of clear definition reflects te purposeful elusiveness of work 
that can be variously classified as poststructural and/or postmodern: Perhaps 
the most important hallmark of all this worz is its aversion to clean positivist 
definitions and categories. For Derrida (1975, 1978, 1981, 1987), the leading 
poststructural writer, every definition "decocstructs" itself—that is, it tends to 
unravel when one probes deeper into its fou-dational assumptions and literary 
gestures (but see Fraser 1984). 

There is substantial overlap between post=tructuralism and postmodernism. 
For my purposes here, poststructuralism (Gerrida, the French feminists) is a 
theory of knowledge and language, whereas postmodernism (Foucault, 
Barthes, Lyotard, Baudrillard) is a theory of society, culture, and history. 
Derrida's influence in the realms of literary criticism, literary theory, and 
cultural analysis has been substantial (Bermen 1988). Literary critics prise out 
of Derrida a methodology of textual read:ng called deconstruction (Culler 
1982). This deconstructive method has sprezd like wildfire through American 
humanities departments, offering a serious caallenge to traditional literary and 
cultural criticism dominated by textual objectivism (e.g. see Ransom 1941 on 
the New Criticism; also see Fekete 1978 and Lentricchia 1980). 

Although Derrida does not elaborate a siazle deconstructive method, refus- 
ing programmatism in favor of his own zxemplary literary, cultural, and 
philosophical readings, it is easy to see that iterary deconstruction challenges 
traditional assumptions about how we read and write (Fischer 1985). Indeed, 
some of these deconstructive insights have begun to fertilize social-science 
disciplines (e.g. in anthropology, Marcus & Fischer 1986; in sociology, 
Lemert 1980, Brown 1987, Agger 1989c), especially with regard to the 
examination of cultural works and practices. I have argued for a blending of 
poststructuralism and critical theory that tredes heavily on Derrida's model of 
textual analysis (see Callinicos 1985, Agg=r 1989a, Poster 1989). Derrida’s 
insights into reading and writing disqualiiy the positivist model of a re- 
searcher who simply reflects the world “owt there," suggesting new ways of 
writing and reading science. 

Derrida insists that every text is undeciaable in the sense that it conceals 
conflicts within it between different authorzal voices—sometimes termed the 
text and subtext(s). Every text is a contested terrain in the sense that what it 
appears to “say” on the surface cannot be uaderstood without reference to the 
concealments and contextualizations of meaning going on simultaneously to 
mark the text's significance (e.g. the we of specialized jargon). These 
concealments and contextualizations might 5e viewed as the assumptions that 
every text makes in presuming that it will be understood. But these assump- 
tions are suppressed, and thus the reader's attention is diverted from them. A 
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sociological example is pertinent here: Where the status-attainment research- 
ers of the Blau-Duncan (Blau & Duncan 1978) tradition defined mobility with 
respect to the occupational status of one's father, a deconstructive reading 
would reveal the profound assumptions about the gendered nature of work as 
well as about male supremacy that underlie this methodological choice. More 
recent feminist scholars (e.g. Bose 1985) challenge the operationalization of 
occupational status in terms of father's occupation because, they argue, this 
represents a powerfully ideologizing subtext.that (a) leads people to think that 
only men work, or should work, and (b) misrepresents reality where, in fact, 
women work outside the home for wages. 

Feminist deconstruction of this aspect of status-attainment work shows, in 
Derrida's terms, that the operationalization of occupational status is “undecid- 
able" in the sense that it engages in certain exclusions that imperil its own 
claim to fixed and final meaning. There is no univocal or unchallengeable 
measure of occupational status; there are only competing versions, each of 
which is incomplete because it engages in certain exclusions. For Derrida, 
deconstructive reading prises open inevitable, unavoidable gaps of meaning 
that readers fill with their own interpolative sense. In this way, reading is a 
strong activity, not merely passive reflection of an objective text with singular 
meaning. Readers help give writing its sense by filling in these gaps and 
conflicts of meaning, even becoming writers and hence challenging the 
hierarchy of writing over reading, cultural production over cultural reception. 

Derrida's notion of undecidability rests on his notions of difference and 
differance. Essentially, he argues that it is in the nature of language to 
produce meaning only with reference to other meanings against which it takes 
on its own significance. Thus, we can never establish stable meanings by 
attempting correspondence between language and.the world addressed by 
language. Instead, meaning is a result of the differential significances that we 
attach to words. Thus, for example, Weber's notion of "status" acquires 
meaning with reference to his concept of "class," not in terms of a fixed 
reality that his word "status" supposedly reflects. Derrida plays on the French 
word differance to show that one cannot hope to arrive at a fixed or transpar- 
ent meaning as long as one uses a necessarily deferring as well as differing 
language: Every definition and clarification needs to be defined and clarified 
in turn; meaning always lies elusively in the future. 

Word choice cannot do our thinking for us, nor solve major intellectual 
controversies. One is fated to improve on the undecidability (and sometimes 
sheer muddleheadedness) of language through more language, which creates 
its own problems of difference/differance and thus occasions its own de- 
construction (see Coward & Ellis 1977). Although (Agger 1989c:335—44) the 
best writing anticipates and acknowledges its own undecidability forthrightly, 
refusing the positivist pretense of transparency once and for all, Derrida is not 
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particularly concerned to strategize about how to write better (or, in the case 
of social science, to do better empirical work). He is more concerned to 
puncture the balloon of those who believe that language is simply a technical 
device for establishing singular, stable meenings instead of the deeply con- 
stitutional act that it is. In itself, this powerfully demystifies positivism by 
calling attention to positivism's own embecdedness in language (and, in the 
case of science, method). There is no royal road to meaning except through 
the meaning-constitutive practices of language that, in turn, provoke new 
confusions, contradictions, and conflicts. Derrida can be read as a gloomy 
relativist where he seems to despair of the possibility of enlightenment. He 
believes that we are destined to remain locked up in the prison house of 
language, as Nietzsche called it (see Jameson 1972). But the fact that Derrida 
bothers to write at all shows his convictioa that language can be clarified, 
even if we do this playfully, allusively and ironically. 

Derrida (e.g. 1976) would defend his owr density by arguing that difficulty 
educates. He would also say that simplicity brings false clarity, suppressing 
the difficulties of making oneself clear that are intrinsic to language's un- 
decidability. In this sense, Derrida joins the Frankfurt School's attack on 
positivism, albeit from a particularly linguistic and literary direction. Where 
the Frankfurt School argued that positivism wrongly exempts itself from its 
own critique of mythology and ideology (value-freedom being a value stance, 
after all), Derrida shows kow this works on -he level of rhetoric: One can read 
his oeuvre as a rhetorical analysis of what h= calls the philosophy of presence 
(another name for positivism) (see Hartman 1981). He shows how the process 
of differing/deferring works on the page, end underneath it, just as he also 
suggests his own work as an example of genuinely deconstructive reading that 
subverts the false simplicity and closure c.aimed by positivists. 

Derrida's relevance to social science is potentially enormous (Agger 
1989c). His poststructural notions of literary criticism suggest ways of read- 
ing and reformulating the densely technical and methodological discourses of 
the empirical social sciences. Must methodology mystify way out of propor- 
tion to its intellectual accomplishments as well as intrinsic difficulty? A 
Derridean would not only crack the code of densely technical and figural work 
characteristic of quantitative social science but would, in this, exemplify a 
more accessible mode of reading and writng. Methodology can be read as 
rhetoric, encoding certain assumptions and values about the social world. 
Deconstruction refuses to view methodology simply as a set of technical 
procedures with which to manipulate daca. Rather, methodology can be 
opened up to readers intrigued by its deep assumptions and its empirical 
findings but otherwise daunted by its densely technical and figural nature. 

To put this generically, deconstruction can help reveal the values and 
interests suppressed far beneath the surface of science. This politicizes and 
democratizes science by opening its text to outsiders, allowing them to engage 
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with science's surface rhetoric more capably as well as to contest science's 
deep assumptions where necessary (e.g. in the case of my previous example 
about how to operationalize occupational status). Science written from the 
perspective of deconstruction avoids overreliance on technical and figural 
gestures; instead it continually raises its assumptions to full view and thus 
invites readers to join or challenge them. Of course, a deconstructive science 
text will never solve all problems of opacity and undecidability; science no 
more than fiction can attain absolute truth, no matter how reflexive it is about 
its own values, assumptions, and methodological choices: Every deconstruc- 
tion can be deconstructed. 

Poststructuralism helps science readers and writers recognize their own 
literary involvements and investments in the text of science. No matter how 
seemingly insignificant, every rhetorical gesture of the text contributes to its 
overall meaning. How we arrange our footnotes, title our paper, describe our 
problem, establish the legitimacy of our topic through literature reviews, and 
use the gestures of quantitative method in presenting our results—all contrib- 
ute to the overall sense of the text. We can learn to read these gestures not 
simply as embellishing “subtext” but also as a central text in their own right, 
making an important contribution to the argument of science. We can also 
rewrite science by authorizing these seemingly marginal gestures, turning 
them into the discursive arguments they really are. Poststructuralism calls into 
question a variety of literary norms of empirical science, suggesting that we 
read science not as a mirror of the world but as a strong, imaginative, 
sometimes duplicitous literary intervention in its own right. Methodology tells 
a story in spite of itself: It can be read rhetorically and hence rewritten in less 
technically compulsive ways, both affording greater access and raising its 
encoded assumptions to view (see Richardson 1990c). 

For the most part, poststructuralists have concentrated on literary and 
cultural texts (although, as I am arguing, there is no necessary reason for 
doing so, given the susceptibility of science to deconstructive analysis). 
Derrida is averse to. science because science almost always claims an exemp- 
tion from the rule of undecidability; he equates positivism with all empirical 
Science. A certain prejudice against science exists on the part of deconstruc- 
tors, who reject all objective analysis, not only the falsely presuppositionless 
objectivism of positivism. This has made it somewhat difficult (if not impos- 
sible) for sociologists to recognize the potential sociological contributions of 
deconstruction. 


POSTMODERNISM: MAIN IDEAS 


More than Derrida’s poststructuralism, postmodernism, especially in the 
work of Foucault (1970, 1976, 1977, 1978, 1980), makes evident con- 
nections with mainstream social science, particularly in the realms of cultural 
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and discourse analysis, the sociology of social control and the study of 
sexuality. I discuss these contributions after I clarify some of the general 
tenets of postmodernism. 

Although postmodernism arguably arose as an architectural movement (see 
Portoghesi 1983, Jencks 1987) the most explicit philosophical postmodernist 
is Lyotard. His (1984) The Postmodern Condition: A Report on Knowledge 
represents the core of postmodern thinking on central issues of modernity and 
postmodernity (also see Newman 1985, Huyssen 1986, Hassan 1987, Feath- 
erstone 1988, Klinkowitz 1988, Harvey 1289, Sarup 1989, Best & Kellner 
1990, Turner 1990). Lyotard rejects totalizing perspectives on history and 
society, what he calls grand narratives like Marxism that attempt to explain 
the world in terms of patterned interrelationships. His postmodernism is an 
explicit rejection of the totalizing tendencies as well as political radicalism of 
Marxism. Like most postmodernists, Lyotard suspects Marxists of self- 
aggrandizing motives. He maintains that ore cannot tell large stories about the 
world but only small stories from the heterogeneous “subject positions" of 
individuals and plural social groups. Jameson (1972, 1976-1977, 1981, 
1984a; see Dowling 1984), an importart literary theorist who examines 
postmodernism through neo-Marxist lenses, suggests that postmodernism 
(“the cultural logic of late capitalism") is fundamentally conservative (Jame- 
son 1984b); Habermas (1981a, 19872) arg.1es that postmodernism is neocon- 
servative (also see Raulet 1984, Wolin 1984). I (Agger 1990) have split 
postmodernism into apologetic and critical versions, extending the angry 
modernist impulse of a politicized postmodernism (e.g. Huyssen 1986) 
toward a merger with critical theory. Typifying the majority of American 
affiliates of postmodernism, Kroker & Cook (1986) attempt to depoliticize 
postmodernism, viewing it simply as a cultural movement (or "scene") (see 
Gitlin 1988). 

A postmodern social theory (see BenRabib 1984, Kellner 1988) would 
examine the social world from the multiple perspectives of class, race, gender 
and other identifying group affiliations. At the same time, this social theory 
would refuse the totalizing claims of grand narratives like Marxism that 
attempt to identify axial structural princioles explaining all manner of dis- 
parate social phenomena (e.g. Marx's theory of the logic of capital). 
Postmodernism is antireductionist and pluralist, both in its causal priorities 
and in its politics, which are more liberal than radical (see Arac 1986). 
Postmodernism mistrusts radicals and racicalism, preferring the decentered 
knowledges available not only to a centra: knowledge commissar but also to 
people who engage with the world from the irreducible perspectives of their 
own experience. 

Foucault (1976, 1980) insists that kno.vledge must be traced to different 
discourselpractices that frame the knowledge formulated from within them. 
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Sociologists of science will recognize Foucault's view of discourse/practice as 
a version of Kuhn's (1970) paradigm notion, although Foucault makes more 
use of everyday experience and ordinary language to define the parameters of 
these paradigmatic knowledges. Foucault has made direct empirical contribu- 
tions to social science where he has studied the discourse/practices of prisons 
(1977) and sexuality (1978), offering rich and varied accounts of how these 
modes of knowledge and practice were constituted historically by way of the 
discourses through which they were made problematic. Although clearly 
influenced by Marxism, Foucault rejects Marxist class analysis for its simple 
dualities (see Poster 1989; but see Dews 1984, 1987 and Fraser 1989). 
Instead, he argues that potential power is to be found everywhere, in the lot.of 
the disenfranchised as well as with the wealthy (see O'Neill 1986). 

Like poststructuralism, postmodernism is profoundly mistrustful of social 
sciences that conceal their own investment in a particular view of the world. 
Like poststructuralism and critical theory, postmodernism rejects the possibil- 
ity of presuppositionless representation, instead arguing that every knowledge 
is contextualized by its historical and cultural nature. At some level, a 
universal social science is judged impossible because people's and groups' 
different subject positions cannot be measured against each other: For ex- 
ample, there is no way to adjudicate the issue of who is more oppressed— 
women or people of color. Instead, it is important to recognize how their 
differential experiences of the world are framed by the discourse/practices 
constituting the experience of being a woman or a person of color at a given 
historical moment. Social science becomes an accounting of social experience 
from these multiple perspectives of discourse/practice, rather than a larger 
cumulative enterprise committed to the inference of general principles of 
social structure and organization. 

Thus, like poststructuralism, postmodernism rejects the project of a univer- 
sal social science, falling back on the particular modes of knowledge defined 
by the multiplicity of people's subject positions. In many respects, this is 
highly reminiscent of social phenomenology and ethnomethodology (e.g. 
Schutz 1967, O'Neill 1974), both of which emphasize the irreducibility of 
experience and reject social-structural analysis. This should not be surprising 
because postmodernism, phenomenology, and ethnomethodology emerge 
from some of the same sources, notably the philosophies of Nietzsche and 
Heidegger, both of whom rejected the Enlightenment's attempt to create a 
universal knowledge. Although phenomenology and ethnomethodology are 
more methodologically inclined than postmodernism, Foucault clearly con- 
verges with Garfinkel (1967) and others (e.g. Cicourel 1973, Douglas 1981) 
in his tradition (see Mehan & Wood 1975). Their main difference is that 
ethnomethodology, unlike postmodernism, affiliated itself to the disciplinary 
project of sociology during and after the 1960s. Postmodernism remains a 
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largely French perspective on cultural and aistorical analysis that bears little 
explicit relationship to French or American sociology.. 

But while postmodernists tend to reject tæ project of science, a postmodern 
social science is possible, especially if ore extrapolates creatively from the 
work of Barthes (1970, 1974, 1975) who, like Foucault, suggests new ways 
to view the sociocultural world. For example, he (1975:92) argues that “the 
city is a discourse," suggesting that by reacang the city we can do useful social 
science, albeit of a type barely recognizaL e to positivist urban sociologists. 
Gottdiener (1990) and Gottdiener & Lagopoulos (1986) further develop a 
postmodern semiotics of urban life, showiæ the relevance of the postmodern 
project to empirical social science. (As . will discuss later, Gottdiener's 
postmodern sociology has drawn fire from establishment sociologists who 
judge the contribution of French theory t be scanty.) 

After Lyotard, Foucault, and Barthes the fourth major postmodernist 
thinker is Baudrillard (1975, 1981, 1983). who offers the most sociological 
version of postmodernism to date (see Kelher 1989a,c). In his early analysis 
of late capitalism, Baudrillard (1975, 1931) suggests that in a consumer 
society commodities acquire a certain sign -alue that people covet. Instead of 
consuming designer-labeled commodities or their use values (Guess jeans 
functioning as clothes and Honda cars as trensportation), people buy them for 
their sign value, a notion akin to Weber arx Veblen's notions of status value, 
albeit grounded in a dense semiotic theory nat builds on the work of Saussure 
(1960) and Eco (1979). In later work, Besdrillard (e.g. 1983) suggests that 
reality (he calls it hyperreality) is increasingly simulated for people, con- 
structed by powerful media and other cultu-al sources. People lose the ability 
to distinguish between these simulations and reality, a precondition of all 
social criticism. This analysis closely resembles the neo-Marxian Frankfurt 
analyses of false consciousness and suggests lines of research in the sociology 
of culture, media, and advertising. 


USING CRITICAL THEORY, PGSTSTRUCTURALISM, 
AND POSTMODERNISM: METHODOLOGY, RESEARCH, 
AND CONCEPT FORMATION 


I have already pointed to ways in which crtical theory, poststructuralism, and 
postmodernism are used, and can be usec. by empirical sociologists. There 
are also important ways in which these tracctions are inimical to the concept of 
a social science: They would transform me concept and practice of social 
science to such an extent that most sociol-gists would scarcely recognize it. 
Let me summarize the explicit contributzsns of these three theoretical per- 
spectives to methodology, empirical reszarch, and concept formation in 
sociology, before I conclude by offering some cautions about these integra- 
tions. 
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The sociological contributions of critical theory, poststructuralism, and 
postmodernism fall into two broad categories. They are methodologically 
relevant to the ways in which people write and read sociology (numbers 1—5, 
below). This mainly involves their critique of positivism. They are also 
relevant substantively in their various contributions to the study of the state, 
ideology, culture, discourse, social control, and social movements (numbers 
6—10, below). 


Methodological Implications 


1. Critical theory forces sociological empiricism to interrogate its own 
taken-for-granted exemption from the sullying interests of perspective, pas- 
sion, polemic, and politics. As Horkheimer & Adorno (1972) argue, positive 
Science is no less susceptible to mythification and mystification than is 
theology. The unquestioned belief in value-freedom is no less problematic 
than the belief in God or spirit. The researcher is perhaps even more vulner- 
able to self-serving self-deception where research is conducted with an 
obliviousness to the powerful forces of what Habermas (1971) calls interest as 
these frame and form the research act as well as the interpretation of findings. 
This is another way of saying that social science should be reflexive, Gould- 
ner's (1970; also see O'Neill 1972b) term for the studied self-reflection 
necessary to deflate the hubris of scientists about the unquestioned superiority 
of their methods over the nonsciences, from literature to philosophy. As 
Gouldner showed (heavily influenced by the Frankfurt School; see Gouldner 
1976), a nonreflexive sociology ignores its own contamination by political 
interests in preserving the status quo (see Boggs 1983). 

2. Critical theory contributes to the development of a postpositivist 
philosophy of science. Although the Vienna Circle's unreconstructed logical 
positivism has been defunct for decades, many working methodologists in the 
social sciences, especially sociology, are practicing positivists, even where 
they have not read systematically in the positivist philosophy of science. 
Habermas (1971) opposes the positivist dichotomy of knowledge and interest, 
arguing that the most valid science recognizes its own grounding in interest, 
hence controlling for the sullying effect of context on one's scientific text. 
The Frankfurt theorists argue that positivism is not only a flawed philosophy 
of science but also a flawed political theory that reproduces the status quo by 
encouraging conformity with alleged social and economic laws. In this sense, 
the Frankfurt theorists broaden the critique of positivism from epistemology 
per se to broader issues of political and social theory, hence overcoming the 
differentiation of epistemology from substantive social theory. In this sense, 
they help deconstruct methodology, showing that method, like the philosophy 
of science, is not simply a technical apparatus but a rhetorical means for 
concealing metaphysically and politically freighted arguments in the densely 
technical discourse/practice of quantitative analysis and figural gesture. Hork- 
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heimer & Adorno's (1972) argument against unthinking use of quantification 
and methodology in the social sciences is a Zontribution to this deconstruction 
of methodology, although, as I argue below, poststructuralism offers an 
equally fruitful critique of methodologism by showing how method can be 
read and hence rewritten as a passionate, perspectival, and political text in its 
own right. 

3. In this sense, poststructuralism corapletes the Frankfurt critique of 
science by showing that we can read a sorts of nondiscursive texts as 
rhetoric—arguments for a certain state of social being. By drawing attention 
to the subtexts of science's literary presentation (e.g. acknowledgments, 
citation practices, preliminary literature reviews, the use of number and 
figure, how discussion/conclusion sections of research articles are phrased, 
endnotes, footnotes, and appendices etc), poststructuralism helps read and 
hence democratize science. Methodology can be cracked open and laid bare to 
outsiders. It can also be written differently. less technically, without sacrific- 
ing important technical detail. Unfortunetely, few poststructuralists have 
attempted this deconstruction of methodolcgy, preferring to concentrate their 
critical attention on cultural and literary texs. But this is beginning to change, 
as the ethnographic sociology of science Latour & Woolgar 1979, Knorr- 
Cetina 1981, Gilbert & Mulkay 1984) is buttressed by this poststructural 
underpinning in discourse analysis (Agger 1989b, Luke 1991). 

4. Poststructuralism reveals how language itself helps constitute reality, 
thus offering new ways to read and wzite science. Its critique of pre- 
suppositionless representation—texts mirro-ing a world "out there"—suggests 
nonpositivist literary strategies for writers. who deconstruct their own work 
and thus heighten their reflexivity. This both filters out subjective contami- 
nants, and, where perspective cannot “and should not) be eliminated 
altogether, it raises an author's deep investments to full view and thus allows 
readers to enter dialogue with them. Habermas (1984, 1987b) argues for a 
universal speech situation governed by norms of dialogical equality and 
reciprocity wherein the goal of consensus formation guides many de- 
hierarchizing social practices. Where writers learn how to bring their own 
foundational assumptions to the surface, not concealing them underneath the 
methodological artifice of science (which counsels dispassion as well as 
technical solutions to substantive problems), they enhance democracy by 
opening science to public debate. 

Blending with critical theory, the poststractural critique of science suggests 
new sciences (Marcuse 1969) that are formulated in different terms. This 
is not simply an argument “for” first-person writing, as if that would solve 
all problems. The occasional intrusion of tne author in the text may only dis- 
guise a deeper commitment to positivist resresentation, in which the author's 
voice is filtered out after initial stage-setting prolegomena. The poststruc- 
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tural critique of science reauthorizes the science text where scientists have 
lost their own voices. In so doing, it challenges the authority of objectivist 
science, interrogating the most basic assumptions of mainstream sociology, 
notably the view that substantive analytical problems can be solved methodo- 
logically. The poststructural critique of science leads to new writing and 
reading practices: Writers excavate their own, often unconscious, pre- 
dilections, and readers learn to do this excavation where writing is couched in 
hardened objectivist prose. 

5. Postmodernism rejects the view that science can be spoken in a singular 
universal voice (e.g. Lyotard's (1984) critique of the grand metanarratives of 
western reason). Although this risks losing the global perspective of the 
Enlightenment (including Marxism), it enables readers to deconstruct the 
universal reason of the Enlightenment as the particularistic posture of Eu- 
rocentric rationality, which contains class, race, and gender biases. 
Postmodernism makes it possible to read universal reason as secret partisan- 
ship just as it suggests ways of detotalizing the voices of science more 
accurately to reflect the variety of so-called subject positions from which 
ordinary people can speak knowledgeably about the world. 

This has the advantage of challenging singular methodologies, whether 
quantitative or qualitative. It would seein to argue for multiple methodologies 
as well as multiple class, race, and gender perspectives on problems. This has 
the additional advantage of empowering a variety of heretofore muted speak- 
ers to join discussions about social issues, legitimating their noncredentialed 
interventions into the scientific field and deprivileging the mainstream posi- 
tivist voice. Postmodern and poststructural ethnographers (e.g. Marcus & 
Fischer 1986) are highly self-conscious about the ways in which their own 
narrative practices impose distorting interpretive frames on people's experi- 
ence. Although this has been a perennial concern of positivist ethnographers 
as well, the postmodern and poststructural attention to issues of discursive 
politics has significantly advanced the ways in which ethnography is com- 
posed (e.g. Richardson 1988, 1990a,b,c, Denzin 1990), especially among 
those who link discourse theory with larger sociopolitical questions of colo- 
nialism and imperialism (e.g. Said 1979). 

These five methodological contributions can be summarized by saying that 
critical theory, postmodernism, and poststructuralism attune working empiri- 
cists to the ways in which their own analytical and literary practices encode 
and conceal value positions that need to be brought to light. Although critical 
theory, stemming from Marxism, is decidedly the most political of these three 
perspectives, it is possible to forge links among critical theory, poststructural- 
ism, and postmodernism, as a number of scholars have begun to do (e.g. 
Ryan 1982, 1989, Smart 1983, Agger 1989a,c, 1990, Kellner 1989b, Arono- 
witz 1990). But even without this leftist political underpinning deriving from 
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one or another version of neo-Marxism, the theoretical critique of the encod- 
ing of value affects everyone who rejects the positivist posture of value- 
freedom, whether from the standpoints of liberalism or radicalism. 

The primary encoding of value in empir-zist social science occurs through 
an uncritical reliance on methodology as a -urely technical device for solving 
intellectual problems. Ironically, many er-pirical sociologists either ignore 
the philosophy of science altogether, as I nced above, or they explicitly reject 
positivism and claim that they are sens ive to Kuhnian concerns about 
framework and paradigm. Unfortunately, even a casual reading of main- 
stream sociology journals suggests that most empiricist sociologists privilege 
methodology over theory and concept formation, even if they disavow this in 
their methodology classes, where they espose graduate students to com- 
pulsory readings in Kuhn and other postpositivist historians, sociologists, and 
philosophers of science. Most empirical arzcles published in ASR and Social 
Forces rely on the rituals of methodology i- order to legitimate a certain form 
of knowledge. In these formulaic journal a—icles, methodology is not written 
or read as the perspectival text it is. Instead. the technical and figural gestures 
of quantitative method are used to suppress the deconstructive recognition of 
the undecidability of the arguments/analyses being presented. Whether or not 
the authors of these articles talk about causality explicitly, using the ver- 
nacular of positivism, virtually all of these =mpirical articles deploy method- 
ological techniques as a rhetorical device tc enhance the science aura (Agger 
1989b:70—72) of the text in question. 

The three theoretical perspectives discussed here help strip away the 
appearance of science's representationalizy in order to show the creative 
authorship underlying every gesture of the science text: We learn that science 
is a literary practice that could be done differently—more democratically and 
less technically. The real author undernezth the leaden objectifying prose 
attempts to cover his or her own footprirss lest the scientificity of the text 
come into question. Critical theory, postzructuralism, and postmodernism 
expose science's apparent authorlessness as one possible rhetorical stance 
among many. 

This helps demystify and democratize not only journal science but the 
whole institution of science, which Foucault (1977) astutely likens to the 
disciplining discourse/practice of the prison, in which surveillance comes 
from the inside as well as the outside. It also suggests new ways of writing 
science, exposing science’s authorial artifice directly in the body of the text 
and not suppressing it with the apparatus of methodology. This is not an 
argument against method (see Feyerabemd 1975) but an argument for the 
literary deconstruction and reconstruction of method as a persuasive, public 
text in its own right. We learn to use the dizcourse of method where it is really 
integral to our arguments, which we do not cast in positivist terms of pure 
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representation but which we recognize are undecidably subject to revision and 
improvement. 


Substantive Implications 


6. Critical theory (e.g. Neumann 1942, 1957, Horkheimer 1973, Haber- 
mas 1975) suggests new ways of theorizing the role of the state and culture in 
advanced capitalism. The state today intervenes in protecting capitalism 
against its own contradictory nature. Capitalism outlives Marx's expectation 
of its demise because the state massively intervenes to alleviate economic 
crisis and popular culture forestalls psychic crisis (see Kellner 1984—1985, 
1989b). The Frankfurt theorists contributed both theoretical and empirical 
analyses of state and cultural intervention, culminating in works like Horkhei- 
mer & Adorno's (1972) Dialectic of Enlightenment, Marcuse's (1964) One- 
Dimensional Man, and Habermas' (1984, 1987b) The Theory of Com- 
municative Action, probably the three most systematic statements of critical 
theory. 

The critical theorists do not abandon Marx’s analysis of the contradictory 
nature of the logic of capital. But Marx could not have foreseen the huge 
growth of the state and mass culture in late capitalism. These studies provoke 
important theoretical and empirical work on the state (Miliband 1974, Offe 
1984, 1985) as well as critical analyses of mass culture (Miller 1988, Luke 
1989). In this sense, Frankfurt critical theory joins more traditional Marxian 
and neo-Marxian economic theory (e.g. O'Connor 1973, Poulantzas 1973) 
as well as the cultural-studies traditions of the Birmingham School, 
poststructuralism, postmodernism, and feminist cultural studies (especially 
cinefeminism) (see Agger 19912). Although the Frankfurt School's political 
economy was not as reductionist as that of more economistic neo-Marxists 
(e.g. Althusser 1969) and although their approach to popular culture was 
more mandarin than that of other cultural-studies analysts (e.g. Marcuse 
1978, Adorno 1984), there are important empirical and theoretical con- 
vergences here that are relevant to sociologists of culture who view culture as 
a structural and hence political factor in late capitalism (see Johnson 1986— 
1987). 

7. Foucault' s postmodernism offers valuable insights to students of social 
control. His Discipline and Punish (1977) revolutionizes the study of crime 
and punishment, particularly in his argument that criminology is a discourse/ 
practice that in a sense creates the category of criminality. This category is 
then imposed punitively on behaviors that formerly were viewed as socially 
legitimate or simply ignored as bizarre. This analysis converges with labeling 
theory (e.g. Goffman 1961, 1974, Becker 1966), although it gives labeling 
theory a firmer historical and political foundation. Foucault helps sociologists 
view deviance in terms of the experiences and meanings that construct it. But 
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unlike labeling and social-control theories, Foucault's postmodern theory of 
discipline stresses the inherent resistances that people mount against their 
labeling and differential treatment. This is = theme that emerges very clearly 
in his (1978) work on The History of Sexu-lity, where he discusses ways in 
which women and homosexuals resist their societal disapprobàtion. Although 
Foucault is sometimes accused of having £ sloppy method, he makes up for 
this in his extraordinarily imaginative usc of historical and cultural data, 
which he assembles into a theory of somal control that neglects neither 
macrolevel nor microlevel phenomena. Thz postmodern study of social con- 
trol, inspired by Foucault, also has implicat_ons for research on organizations, 
as Cooper & Burrell (1988) have demons-ated. 

8. Derrida’s poststructuralism and Bcadrillard's postmodernism offer 
valuable contributions to the sociologica. study of discourses, potentially 
enriching a wide range of sociological subfields including the sociology of 
mass communication and media, the sociol-gy of knowledge, and the sociol- 
ogy of science. Derrida’s deconstructive przgram contributes substantively to 
the interpretation of cultural and linguiste forms. Using semiotic theory, 
Baudrillard decodes cultural images and w-rks for their sociopolitical mean- 
ings. His (1981) For a Critique of the PoF&ical Economy of the Sign moves 
the Marxist theory of culture a significa-t step beyond orthodox-Marxist 
cultural and aesthetic theory, arguing that Historical materialists now need to 
consider the relative autonomy of symbcic and cultural systems without 
giving up the traditional political-econom& focus of Marxism. Baudrillard, 
like the Frankfurt School (e.g. Horkheimer & Adorno 1972), gives cultural 
sociology a stronger theoretical foundatior as well as critical resources with 
which to decode cultural simulations as fase representations of reality. 

Critical theory and poststructuralism bot- generate deconstructive readings 
of cultural works and practices like televis:en (Kellner 1981, Best & Kellner 
1988a,b,), journalism (Hallin 1985, Rachlm 1988), and advertising (Kline & 
Leiss 1978, Williamson 1978, Wernick 1=83, Ewen 1976, Leiss, Kline & 
Jhally 1986) as literary works encoding powerful authorial claims about the 
social world. Denzin (1989) has read films in which alcoholics and alcohol- 
ism are depicted. The growing literature om »oststructural literary and cultural 
interpretation (e.g. Mulvey 1989) can illuminate sociological studies of cul- 
tural practice and meaning, helping reverse their sociologically reductionist 
tendencies by showing the interplay betweea expression and interpretation. In 
this sense, poststructuralist discourse theory could converge with non- 
Derridean approaches to discourse (e.g. euthnow 1987) in a fruitful way. 

9. In particular, a postmodern and postst-uctural feminism (Kristeva 1980, 
Irigaray 1985, Cixous 1986, Weedon 1937) suggests concrete empirical 
studies of the ways in which discourses like "ilm (Mulvey 1989) are structured 
by gendered themes. In particular, femimist cultural studies focus on the 
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different power positions of women and men as these influence writing and 
reading. With Lacan and Lyotard, these approaches to feminism reject the 
notion of a singular (male) vantage point from which knowledge and dis- 
course are developed. They attune us empirically to the ways in which 
knowledge of the world is structured by discourses (e.g. pornography) that 
reflect conflict over power; they decode these discourses as politically salient. 
Richardson (1988, 1990a,b,c) has systematically developed the sociological 
implications of poststructural and postmodern feminism with respect to the 
ways social scientists tell their research stories: Fraser (1984, 1989) has 
blended critical theory, feminist theory, and poststructuralism in her develop- 
ment of a theory of practice relevant to women as well as men. 

10. The "new social movements" theory of Habermas (1981b; also see 
Boggs 1986) offers theoretical insights to scholars of social movements who 
otherwise lack a larger theoretical perspective that explains where these 
movements come from and what sort of structural impact they might have. 
Finding a course between orthodox-Marxist theories of class struggle and 
non-Marxist perspectives on social movements, Habermas retains the Marxist 
vision of transformational sociopolitical action while significantly altering 
left-wing orthodoxy with respect to movements deemed irrelevant by tradi- 
tional Marxists, especially movements of people of color, women, anti- 
colonialists, antinuclearists, environmentalists, etc. Here, as in his (1975) 
analysis of the state's legitimation crisis, Habermas makes contact with 
venerable sociological concerns and places them in a larger historical- 
materialist framework, recouping their most radical insights in spite of them- 
selves (e.g. his reading of Parsons in The Theory of Communicative Action 
(1987b:199-299). Unlike most sociological students of social movements 
(and like Foucault in this respect), Habermas locates points of resistance 
against systemic domination that give his overall critical social theory a 
certain practical intent (see Kellner 1989b). 


WHAT COUNTS AS SOCIOLOGY? A 
CAUTIONARY NOTE 


For mainstream sociology to adopt, and thus adapt to, these three theoretical 
perspectives would substantially change the nature of the discipline. At some 
level, the notions of poststructural and postmodern sociology are oxymorons: 
Postmodernism and poststructuralism, like critical theory, resist their integra- 
tion into a highly differentiated, hierarchized, technical discipline that defines 
itself largely with reference to the original sociologies of Comte, Durkheim, 
and Weber, who established the positivist study of social facts and separated 
the vocations of science and politics. These three theoretical perspectives 
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question the rights of academic disciplines `o exist apart, especially within the 
frame of positivism. 

Critical theory, poststructuralism, and postmodernism challenge the ter- 
ritoriality of sociology, including its differentiation from other disciplines in 
the human sciences as well as its heavy »eliance on method with which to 
solve intellectual problems. All three perspectives oppose the mathematiza- 
tion of the world, even if they logically allow for mathematics as one 
discourse among many. This is not to pxivilege qualitative methodology. 
After all, qualitative methods can be as pos.tivist as the quantitative kind. The 
poststructural critique of language casts doubt on ethnographies which rely on 
subjects’ accounts of their own experienze as if these accounts, like the 
accounts of experts, are not already encoded with undecidable meaning. 

These three theoretical perspectives redefine the human sciences and cul- 
tural studies in ways that blur traditional disciplinary boundaries (Brodkey 
1987). They are all committed to interdizciplinarity (see Klein 1989), de- 
constructing disciplinary differentiation as arbitrary. Derrida's strategy of 
reading emphasizes the intertextuality of writings that attempt to seal them- 
selves off from the contaminating influences of other versions, other writers, 
other disciplines. He argues that all texts are inflected by other texts to the 
point of genuine interdisciplinarity. In other words, these three theoretical 
perspectives open up the question of wha* counts as sociology. 

Sociology has progressed far beyond (or regressed far behind, depending 
on one's perspective!) the sweeping specu ation that characterized sociology 
in the classical tradition. This classical tracition has been enshrined as Grand 
Theory (see Agger 1989b:181—86) in order to legitimate the subsequent 
technical discipline; witness the telling publication of an obscure Parsons 
paper (Parsons 1990) as a lead article in a recent ASR, as well as sympathetic 
commentaries on it by two functionalist luminaries (Alexander 1990, Cole- 
man 1990): The Parsons article is positioned in order to add canonical value to 
the technical articles following it, enhancing disciplinary territoriality and 
identity at a time when mainstream sociology risks becoming mathematics. 

The theoretical challenge to sociological territoriality posed by critical 
theory, poststructuralism, and postmoderrism is resented in some quarters. 
Rebutting Gottdiener (1990), Randall Collins (1990:462) dismisses Foucault 

-as a theoretical “amateur.” Collins does rot recognize that Foucault would 
have loved to be called an amateur. Foucault implies that the professional/ 
amateur distinction is a peculiar product of the discourse/practice of late 
capitalism, wherein unofficial knowledges are disqualified as unrigorous, 
undisciplined, unprofessional. Foucault's amateurism positions itself outside 
of disciplinary mainstreams so that he can gain a useful vantage on them. 
Similarly, Habermas’ encyclopedic grasp of a huge range of disciplinary 
literatures, from psychology and political theory to economics and sociology, 
threatens the narrow professionalism of disciplinary academics. 
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Critical theorists, poststructuralists, and postmodernists rethink the prevail- 
ing definition of what counts as sociology; they would enlarge that definition 
considerably. In so doing, they risk losing productive disciplinary identity and 
a workable professional division of labor, but they stand to gain an enriched 
perspective on the literary and substantive practices of sociology. Whether or 
not mainstream sociology will countenance theoretical and political interroga- 
tion of the kinds provided by the three theories discussed here has yet to be 
determined. | 
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Abstract 


The concept of vacancy chains, originally developed in Harrison White's 
pioneering analysis of organizational mobility processes, has been extended 
to phenomena as diverse as national labor and housing markets, the historical 
development of professions, gender and ethnic group discrimination in job 
and housing markets, organizational demography, and the mobility of hermit 
crabs to empty snail shells. In all populations in which they occur—whether 
human or animal—vacancy chains appear to organize a variety of social 
processes in nearly identical ways. This chapter provides a broad and relative- 
ly nonmathematical review of the vacancy chain literature covering basic 
. definitions and formulations, main theoretical ideas and assumptions, com- 
parisons of social processes in different vacancy chain systems, and several 
conceptual and methodological extensions to vacancy chain analysis. The 
review concludes by discussing a number of outstanding problems, present 
limitations, and promising areas for future research using the vacancy chain 
approach to mobility. 


INTRODUCTION 


Sociologists as well as economists and biologists often consider individuals 
who gain material resources or social positions as theoretically independent, 
virtually isolated entities. In this view what is done by one individual does not 
affect other individuals except in a simple negative way: If one individual gets 
a resource unit, other individuals do not; mobility is a zero-sum game. But in 
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those cases where material resources or social positions are distributed 
through vacancy chain processes, things work differently. Here the mobility 
of several individuals is linked together: What happens to one individual at 
one point in time and space redounds to affect other individuals at other points 
in time and space. Because of this fact, studies of mobility through vacancy 
chains rely upon different theoretical conceptions and methodological tech- 
niques than do studies considering individuals as independent units of analy- 
sis. Application of the more standard theorerical concepts and methodological 
techniques to systems in which individuals actually move through vacancy 
chains can, consequently, produce misleadimg analyses of causes and patterns 
of mobility. 

This chapter provides a broad and relatively nonmathematical review of the 
vacancy chain literature. It begins with sasic definitions, moves to the 
common properties of all vacancy chain systems, continues with a variety of 
extensions based upon the vacancy chain concept, covers some outstanding 
problems in data collection and methodology, and concludes by contrasting 
the theoretical underpinnings of the vacancy chain approach to mobility with 
those of more standard approaches in socielogy. This chapter suggests that 
resource distribution through vacancy cha ns organizes a variety of social 
processes in similar ways in any population in which it occurs—whether 
human or animal. So individuals who ostersibly have nothing in common— 
such as Methodist ministers moving to new and better pastorates, state police 
officers in Michigan gaining new jobs, biyers and renters acquiring new 
dwellings, people buying new and used cars, and hermit crabs occupying 
empty snail shells—are all alike in that they gain new physical resources or 
social positions through vacancy chain mobility processes (citations for these 
and other points will be given in the sections following this Introduction). 
Underneath all the apparent diversity of »ehavior, culture, and cognitive 
ability in these different systems, vacancy chains produce similar patterns of 
short-term mobility, generate about the seme average number of mobility 
opportunities per chain, affect career and demographic distributions in the 
relevant systems in about the same way, and influence analogous patterns of 
aggregate benefits for mobile individuals as well as for other individuals 
connected to those moving. 

In addition to these structural similarities in all the vacancy chain systems 
investigated, researchers have applied vacancy chain analysis to the study of 
other central problems in sociology including the operation of national, 
regional, and organizational labor markets; national and regional housing 
markets; gender and ethnic group discrimination in job and housing markets; 
the tracing of elite groups within professiors; and the historical development 
of professional groups. Building upon the recent demonstration of the use of 
vacancy chains in animals, researchers have shown the importance of vacancy 
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chain theory for a more adequate understanding of the behavior, demography, 
and ecology of species using this type of resource acquisition process. 


BASIC DEFINITIONS AND FORMULATIONS 
What Is a Vacancy Chain? 


In a vacancy chain, a new resource unit coming into a population is taken by a 
first individual who leaves his/her old unit behind, this old unit is taken by a 
second individual leaving his/her old unit behind, and so forth. For example, 
if the president of a company retires, her job may be taken by a vice- 
president, his job by the head of sales, and so on. Eventually a chain has to 
end, usually when a new recruit enters the system—someone moving into the 
job or housing market for the first time—or the last unit in a chain is 
abandoned, merged, or destroyed—the last house in a chain is torn down, left 
empty, or the duties of a vacant job are distributed among other employees. 
Vacancy chains are started when an initial vacancy comes into a population— 
a new house is built, a new car manufactured, or a new job created—or an 
existing unit is vacated by someone leaving the system under consideration— 
an employee retires, a hermit crab dies leaving the snail shell it occupied 
intact, or a home owner goes to a nursing home. 

A vacancy chain is simply the sequence of moves that a vacancy makes 
from initial entry into a system to final termination. Figure 1 shows a vacancy 
chain in the Methodist church taken from White’s (1970: 68) study of 
mobility in three American churches. This chain starts with a retirement and 
ends with the recruitment of a newly ordained minister. The vacancy in this 
chain moves five times, starting with the initial move from the church on the 
extreme left and ending with the final move out of the mobility system. This 
chain provides mobility opportunities for five ministers—not counting the 
move to retirement for the minister creating the chain. 


Conditions Required for Vacancy Chain Mobility 


The distribution of material resources and social positions through vacancy 
chain processes requires that the things distributed possess a number of 
abstract qualities. These qualities include that the resource units in question 
are reusable, discrete, identifiable, and utilized by one individual or social 
group at a time. Further, in vacancy chain mobility systems, a unit must be 
vacant before it can be taken by a new occupant; individuals must need or 
want new and usually "bigger" or "better" units from time to time; vacant 
units must be scarce (the number is small compared to the number of 
individuals who want them); and most individuals in a group already must 


136 CHASE 


e Ñ FY Vacancy 






Retirement, 
Death 








wba 


Figure } A sample vacancy chain from the Methodist church, from White (1970: 68). 


have units so they can leave one behind when they move to a new one (the 
number of recruits is relatively small compared to the number of individuals 
already in the system). Combinations of sccial conventions, economic con- 
siderations, or the physical nature of the resource units can produce the 
general conditions required for distribution -hrough vacancy chain processes. 
White (1970), Sørensen (1983), and Sgrersen & Tuma (1981) discuss the 
nature of jobs in bureaucracies that allow them to be distributed through 
vacancy chains. Sørensen (1983) grounds h.s work on the earlier discussions 
of Boudon (1974), Thurow (1975) and Hirsch (1976). Later in this paper I 
suggest a variety of physical resources and social positions that possess the 
requisite qualities for distribution through vacancy chains par which have not 
yet been investigated. 


Collecting Vacancy Chain Data 


In order to trace a vacancy chain, a researcker notes a vacancy coming into a 
system and then follows this vacancy as it moves to a number of resource 
units in turn and then finally leaves the system. Because vacancy chains in 
humans often take considerable periods of time, on the order of months, to 
move from beginning to end and because the chains often involve people 
widely scattered geographically, researchers usually trace the chains by 
reconstructing them from organizational records or from series of linked 
interviews. For example, White (1970) used yearbooks listing ministers and 
their churches to trace chains, and Lansing et al (1969) followed chains across 
the United States by interviewing families moving into new houses, the 
families replacing them at their old addresses, and so forth. In contrast, 
vacancy chains in snail shells utilized by hermit crabs move from beginning to 
end in about an hour within a restricted physical area, and researchers (Chase 
& DeWitt 1988, Chase et al 1988) directly observed these chains and were 
able to record some on videotape. Hermit crabs do not have protective 
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exoskeletons completely covering their bodies as regular crabs do, so they 
must live in and continually carry around empty snail shells as portable 
shelters. The observers traced vacancy chains in this species by introducing an 
empty snail shell into a tidal pool and recording the sequences of moves 
initiated as a first crab took the new shell and left its old one behind. 


Transition Matrix 


A transition matrix summarizes the raw data from a set of vacancy chains; in 
the form of a transition frequency matrix it shows all the movements that a set 
of vacancies make, including the terminal moves—those outside the system 
that end chains (when new recruits take final units or final units are abolished, 
abandoned, or destroyed). Table 1 gives a matrix showing the frequencies of 
moves made by a set of 92 vacancy chains for hermit crabs moving to empty 
snail shells (Weissburg et al 1991). Taking the first row of the matrix, for 
example, vacancies moved two times from old shells of size 1 to new shells 
also of size 1, seven times from size 1 to size 2 shells, etc, while one vacant 
size 1 shell was abandoned and left unclaimed. In contrast to other kinds of 
mobility studies, this matrix shows the moves of vacancies rather than 
individuals, because in vacancy chain systems vacancies are causal: they must 
exist first, before individuals can move. But the movements of individuals can 
be inferred from the movements of vacancies, for example, in cell (1,3) of the 
matrix, vacancies moved nine times from size 1 to size 3 shells, and the same 
number of individuals moved in the opposite direction, from old size 3 shells 
to new size 1 shells. 

If the cell entries in a matrix showing frequencies are divided by their 
respective row totals, a new matrix showing the maximum likelihood es- 
timates of the probabilities of a vacancy moving between various strata of 
resource units results. These matrices are referred to as transition probability 


Table 1 Transition frequency matrix for vacancy moves in snail shells by 
hermit crabs from Weissburg et al (1991)? ` 








Destination state 
Transient states Absorption states 
2-3 4 5 Naked? Abandoned? Total 








= 


Origin state 





d 


1 2. 24 9 2 0 0 H 21 
2 '0 3 19 17 1 0 42 
3 0 2 20 1] 10 4 23 70 
4 0 0 10 26 26 6 24 92 
5 0 0 0 5. 322 2 30 59 


ê Reproduced with permission from Evolutionary Ecology 

> Termination by crabs without shells taking the last shell in a chain 
€ Termination by the last shell in a chain not being taken 

“State 1 shells are the largest and state 5 shells are the smallest. 
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Table2 Transition probability matrix for vacancy moves in snail shells taken 
by hermit crabs from Weissburg et al (1991)* 


Destination state 
Origin state 1 2 3 4 5 naked? ^ abandcned* 


14 .40  .33 43  .10  .00 .00 .05 
2 00 07 .45 40 .02 .00 .05 
3 00 .03 .29 .16  .14 .06 .33 
4 .00 .00 .11 .28 28 .07 .26 
5 .00 .00 .00 .08 .37 .03 i 


*see footnote a in Table 1 
>see footnote b in Table 1 
*see footnote c in Table 1 
4see footnote d in Table 1* 


matrices in the Markov formulation, to be explained below, and substantively 
they summarize the typical short-term mobility movements of individuals in 
vacancy chain systems. Table 2 shows Teble 1 converted into a transition 
probability matrix. 


Markov Models of Vacancy Chain Data 


Most researchers working in the vacancy caain tradition use simple Markov 
models as tools to describe further their raw vacancy chain data and to make a 
number of predictions that can be compared with observed data. These 
models are usually embedded, first-order Markov chains with absorbing 
states. Markov chains are used because these models assume that a process 
moves from state to state, as a vacancy moves from one resource unit to 
another, and that a distinct probability can be assigned to each possible 
transition. “First-order” means that the researcher assumes that the next 
position to which a vacancy moves is determined solely by the position that it 
is currently in; an individual thinking about moving evaluates only the vacant 
position; he/she does not care about, and probably does not even know about, 
the positions that the chain was in before the current one. "Embedded" refers 
to the fact that the model does not consider how long it takes a chain to move 
in terms of clock time; the model treats th» sequence of moves that a chain 
makes but not how long is required for it to do so. The absorbing states are in 
the model because all chains eventually end; they do not go on indefinitely. A 
chain is absorbed when it terminates beceuse the last vacant unit is aban- 
doned, destroyed, or merged with an existing unit, or when the last unit is 
taken by an individual who does not leave an opening behind, e.g. new 
recruit, an adult child departing his/her parents’ home for a first apartment, or 
a divorcing husband leaving his wife and children behind in the family house. 

In the model vacancies move between nits (positions) characterized by 
states (strata). States, as in other sociolog:cal studies of mobility, are often 
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determined by simple attributes of positions relevant to status within the 
particular system of concern. For example, White (1970) used income and 
membership size to divide churches into states; Sands & Bower (1976) used 
sale or rental price for housing units; and Weissburg et al (1991) used physical 
size for the snail shells occupied by hermit crabs. Some researchers have 
combined different qualities to determine state boundaries: Marullo (1985) 
studied housing using both sales price and race of owner, and Weissburg et al 
(1991) used both size and quality for snail shells. In studies of national and 
regional labor markets, Harrison (1988, 1990) and Stewman (1985) allocated 
states by type of occupational grouping. 

In deciding upon the number of states, researchers must balance two 
opposing trends. First, the mathematical model assumes that any two vacan- 
cies in the same state are identical in terms of their probabilities of movement 
to the other states. In general dividing the positions into many states makes for 
more homogeneous categories, but this is balanced by a second consideration. 
Using many states requires many observations in order to estimate transition 
probabilities accurately, and having many observations requires assembling 
large and often expensive and time-consuming data sets. In practice most 
researchers have used between three and seven states except in large-scale 
studies of national or regional labor markets where more have been used 
(Stewman 1985, Harrison 1988, 1990). 

Decisions about where to draw boundaries between statuses, as in other 
kinds of mobility studies, are often somewhat arbitrary in vacancy chain 
studies. Often the attributes used to rank the states are continuous, like salary 
or shell size, and researchers utilize a combination of exploratory data analy- 
sis and their own intuitive knowledge of the particular system to mark 
boundaries (see White 1970 and Weissburg et al 1991 for discussions of how 
to draw boundaries among states). Developing rigorous statistical techniques 
to form state categories is a difficult statistical problem, but a technique 
developed by Rosenzweig (1986) in another context may be adaptable for use 
in separating states (Weissburg et al 1991). 


COMMON PROPERTIES OF VACANCY CHAIN 
SYSTEMS 


Short-Term Mobility Movements 


The general patterns of short-term mobility are virtually identical in all the 
vacancy chain studies reviewed for this paper. Tables 3 and 4 give additional 
transition probability matrices for studies of job (White 1970) and housing 
mobility (Marullo 1985), to go with the matrix for hermit crabs (Table 2). In 
these and other matrices reviewed, vacancies usually move to other positions 
within the same state or they move downward one or sometimes two states in 
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Table 3 Transition probability matrix for vacancy 
moves in the Methodist church 1922-1937 from White 
(1970: 125)? 


Destination state 
Origin state Big Medium Small Outside? 


Big 46 3 05 16 
Medium Ai Al 12 36 
Small 02 16 24 58 


*Reproduced with permission frcn Harvard University 
Press 

* Indicates probability of absorption >y all ways of terminat- 
ing chains combined 


status. Vacancies rarely move to higher ranEing states, and vacancies are most 
often absorbed from lower-ranking states. 

Looked at from the point of view of individuals rather than vacancies, 
individuals affected by chains starting in & given state usually are upwardly 
mobile in small to moderate jumps in status or Jaterally mobile within their 
same status; they rarely experience downward mobility. In addition, chains 
are much more likely to terminate from a iow-status than from a high-status 
position. Recruits are much more likely tc enter a mobility system near the 
bottom, and low-ranking positions are much more likely not to be taken by 
any individual (abandoned), to be destroyed, or to be merged with existing 
positions. To look at these matrices in vet another way, vacancy chains 
typically flow from higher-ranking or larger resource units to lawer-ranking 
or smaller ones as individuals trade upward to higher-status or more com- 
modious positions. 


Table 4 Transition probability mat-ix for vacancy moves in 
owned housing units, 1976—1977, from Marullo (1985 : 371)* 


Destination state 


Origin state High Medum Low Outside” 
High 45 B .09 .08 
($50,000 plus) 

Medium .08 39 35 .19 
($30,000— 49,999) 

Low .01 2 38 .52 
($0—29,999) 


^ Urban Affairs Quarterly, volume 20. issue number 3, page 371, 
copyright € 1985 by Sage Publications, mc. Reprinted by permission 
of Sage Publications, Inc. 

^ Indicates probability of absorption by z1l ways of terminating chains 
combined 
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Length of Vacancy Chains 


The length of a vacancy chain is the number of moves a vacancy makes after 
its entrance into a system, and this includes the terminal move to the absorb- 
ing state. Borrowing from input-output economics, White (1970) refers to the 
length of a vacancy chain as its multiplier effect: The impact of the appear- 
ance of a vacancy is multiplied as it makes several movements through a 
mobility system. For example, the multiplier effect for the chain in Figure 1 is 
5.0. 

In mobility systems in which researchers have traced vacancy chains from 
initiation to termination, the multiplier effects reported are remarkably similar 
in size. Usually average chains have a length around 3.0; chains begun with 
elite resource units produce chains of length from nearly 4.0 to nearly 5.0. 
Chains with lengths in these ranges have been reported for mobility systems 
as ostensibly diverse as those for people gaining jobs in bureaucracies (White 
1970, Stewman 1975a, 1985), hermit crabs acquiring snail shells (Weissburg 
et al 1991), and people buying houses (Lansing et al 1969, Marullo 1985) and 
cars (Smith 1941). A number of other researchers have reported chains with 
smaller multiplier effects, but in these studies the researchers did not trace 
chains from beginning to end but terminated their chains prematurely when 
they went beyond some arbitrary boundary such as a local housing market or a 
particular job market (Kristof 1965, Watson 1974, Sands & Bower 1976, 
Smith 1983, Smith & Abbott 1983). 

Researchers oftén determine how well a Markov model fits observed 
vacancy chain data by comparing observed and predicted average multiplier 
effects. The predicted distribution of expected vacancy chain lengths is given 
by what is referred to as the fundamental or multiplier matrix in the Markov 
formulation: 


N2(-Q' 1. 


where Q is an m X m submatrix of the transition matrix including only the 
probabilities of transitions among the various transient (nonabsorbing) states, 
and I is an m X m identity matrix. Each element nj of N equals the expected 
number of times a vacancy starting out in state i will be in resource units in 
state j before it is absorbed. Most standard texts discussing Markov chains 
with absorbing states give the derivation of this equation (e.g. Kemeny et al 
1966, Fararo 1973, Roberts 1976, Heyman & Sobel 1984). For matrices of up 
to moderate size, this calculation is easy to program and perform on any 
microcomputer. 

The row sums of N give the expected multiplier effects for chains starting 
in each of the possible transient (nonabsorbing) states. In matrix notation 


142 CHASE 
n = NI 2. 


where n is a column vector giving the predicted multiplier effects and lis a 
column vector of 1’s. In most vacancy chain studies which have compared 
actual empirical estimates of multiplier effects with predictions made by 
Markov models, fits have been very close (2.g. White 1970, 1971, Stewman 
1975a, Smith 1983, Smith & Abbott 1982, Weissburg et al 1991). 


The Number of Individuals Experiencing Mobility in a Chain 


The number of individuals experiencing mooility in a vacancy chain is closely 
related to a chain’s multiplier effect, and th» exact relationship depends upon 
the way in which a chain terminates. If a clrain terminates (is absorbed) when 
an individual without a resource unit, e.g. a new recruit, enters the mobility 
system, the multiplier effect of the chain 2quals the number of individuals 
experiencing mobility in the chain. This is the case in Figure 1 where the 
multiplier effect is 5.0 and the number of mobile ministers is also 5.0. If 
instead a chain terminates because the last position in the chain is abandoned, 
destroyed, or merged with another position, the vacancy is considered to 
make the last move to an absorption state svithout a corresponding mobility 
event by an individual. In this case the number of mobile individuals is one 
less than the multiplier effect. 

Most vacancy chain studies do not specifically report the average number 
of mobile individuals for chains starting in various states (but see Weissburg 
et al 1991). However, the fact that researchers report similar multiplier effects 
in mobility systems involving jobs (White 1370, Stewman 19752), snail shells 
(Weissburg et al 1991), houses (Lansing et al 1969, Marullo 1985), and cars 
(Smith 1941) implies that remarkably simalar numbers of individuals have 
mobility in typical vacancy chains in all these systems because, as indicated 
above, the multiplier effect and the numbe- of mobile individuals in a chain 
are equal or differ at most by unity. 

It is quite easy to calculate the expected number of mobile individuals in 
chains starting in various strata using the scandard Markov model. First, the 
probabilities of a chain terminating in abscrption states which do and which 
do not provide one final mobility opportun&y for an individual are calculated 
as follows: 


B = NR 3. 


where R is an m X 2 submatrix of the origimal transition matrix, with the first 
column giving the combined probability of absorption in those states provid- 
ing one final mobility opportunity for individuals and the second column 
giving the combined probability of absorption for those states not doing so. 
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The matrix B then shows the probabilities of a chain terminating in either kind 
of absorption state given the initial state of the chain. 

The average number of individuals experiencing mobility for chains started 
in the various states is given as: 


Avgl;-(biXxn)-ti1—-b)x(n-D]-bi-tn-ili 4. 


where b; is the probability that a chain starting in state i is absorbed in a state 
generating one final mobility event, 1 — 5j; is the corresponding probability 
of termination in any state not providing a last mobility opportunity, and n; is 
the expected multiplier effect for a chain starting in state i. 


Detailed Predictions of Individual Mobility 


A common set of techniques provides detailed predictions of individual 
mobility in any vacancy chain system (see Stewman 1986a for the derivation 
of these methods). Using these techniques, a researcher can predict the 
number of individuals moving from positions in one state to those in any other 
state, including absorption states, given a distribution of vacancy chains 
starting in the various states of a system. For example, a researcher can 
answer questions of this sort: What specific mobility movements would result 
if 12 new chains were begun in the highest stratum of units, 43 chains in units 
of the stratum just below, etc. 

One of the most useful techniques for inane detailed mobility predictions 
starts with the creation of a simple from-to matrix which shows the expected 
number of times a vacancy will move from one particular state to any other 
particular state, given the state in which the chain started. Each element of 
such a matrix can be calculated as mik) = ny X tj, where nj is an element 
from the fundamental matrix and fj is one from the full transition matrix 
(including absorbing states). Thus, the element mj, is the product of the 
number of times a vacancy starting in state i will be in state j before absorption 
and the probability that a vacancy in j will move to state k. In matrix form 


M; = D, T, 5. 


where Dj is a diagonal matrix of elements from the ith row of the fundamental 
matrix. 

The from-to matrix M; shows all the movements of one vacancy starting in 
state i as it moves through and finally out of the mobility system. If several 
chains start in state i, then M; must be scalar multiplied by the number of 
chains in order to predict all the mobility movements deriving from these 
chains. The resulting matrix shows the detailed from-to movements for 
vacancies resulting from these chains, and the transpose of this matrix gives 
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the detailed predictions of mobility movements for individuals. If the basic 
from-to matrix (Mj) for each possible starting state for a vacancy chain is 
scalar multiplied by the number of chains starting in each state, and if the 
resulting matrices are added together and then transposed, a grand matrix of 
detailed from-to mobility predictions for irdividuals results. Although these 
calculations are easy to carry out, comparisons between observed and pre- 
dicted from-to mobility patterns have rarely been carried out in published 
studies of vacancy chain mobility. Exceptions are found in Stewman (1985) 
and Weissburg et al (1991), both of which show good fits between predicted 
and observed mobility movements in job end hermit crab mobility systems, 
respectively. 

Because various attributes of individuals are correlated with their positions 
in a mobility system, from-to matrices could also be used to make detailed 
predictions about the number and characteristics of mobile individuals. For 
example, in human vacancy chain systems, income level and age are often 
correlated with position, and in hermit c-ab systems body size is closely 
correlated with shell size (Chase et al 1988). Using this information, a 
researcher could predict not only how many individuals would be mobile from 
specific states to specific states but also what are the individual attributes of 
those individuals making the transitions. Eesearchers have not yet reported 
such predictions, but vacancy chain theory can easily be advanced into this 
area. 

Other detailed mobility predictions car be derived from the matrix of 
from-to predictions or can be calculated imlependently. For example, Stew- 
man (1986a, 1985) derives and predicts the number of promotions, transfers 
(lateral moves), demotions, and new recruits hired as a result of a set of chains 
begun in the various states of a model. 


Aggregate Benefits and Deficits Resulting from the Initiation 
of Vacancy Chains 


In all vacancy chain systems the introduction of an initial set of vacant 
resource units has predictable, aggregate .mpacts for those individuals ex- 
periencing mobility. These impacts can be benefits in some systems—for 
example, humans getting new jobs often mcrease their salaries, and hermit 
crabs acquiring new shells grow larger and produce more offspring. In other 
systems mobile individuals may suffer deficits, as in housing chains where 
individuals moving often pay greater housing costs (Lansing et al 1969) than 
they did previously (they also may benefit by having more space, being nearer 
their places of work, having better school systems for their children, etc). 

Although no researcher has yet applied taem, the same set of methods can 
be used to predict aggregate benefits or deficits in different kinds of vacancy 
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chain systems. Given detailed predictions of mobility resulting from an initial 
set of vacant units, such as provided by the from-to matrices discussed above, 
and the average increase in benefits or deficits, a researcher could easily 
predict the aggregate impact for a population. To provide an illustration, 
assume that a researcher has calculated a grand from-to matrix for all the 
predicted individual moves in an organization as a result of the retirements, 
deaths, and new jobs expected to be created in the next year. Each entry in the 
matrix is then multiplied by the respective increase or decrease in salary 
associated with such a move. Adding up all these products would give the net 
aggregate income increase for the individuals in the organization. Organiza- 
tional managers could use the same kinds of calculations to assay the impact 
of labor cuts, changes in the retirement age, and creation of new jobs upon 
labor costs and benefits to be paid (see Stewman & Konda 1981, Konda et al 
1981, Stewman 1988). 


Vacancy Chains and Associated Individuals and Institutions 


Vacancy chains have patterned effects upon other individuals and, in the 
human case, institutions connected to the primary individuals actually getting 
resources in the chains. For humans in chains for cars and houses, consider 
individuals such as real estate agents and car sales personnel and institutions 
such as banks and state and local governments. The individuals profit from 
sales commissions and agents' fees and the institutions from taxes, interest, 
and other payments when vacancy chains move through these mobility sys- 
tems. Longer chains and more costly resource units generate greater benefits, 
while shorter chains and less costly units generate smaller benefits, for the 
individuals and institutions involved. In hermit crab vacancy chain systems, 
the individuals connected to the crabs are members of the epibiont community 
of animals and plants such as algae, gastropods, sea anemonies, and hydracti- 
nia which compete to live on and in the snail shells occupied by the crabs 
(Chase & DeWitt 1988). Vacancy chains influence the fates of epibionts: 
those on larger shells which are more likely to be passed on in vacancy chains 
live longer and reproduce more than those on smaller shells likely to be 
abandoned at the ends of chains. 

At present researchers have not tried to predict the aggregate benefits 
received by associated individuals and institutions, but this is yet another area 
into which vacancy chain theory can be extended. The required techniques are 
analogous to those described above for estimating aggregate benefits for the 
primary individuals actually moving in chains. The level of benefits (or 
deficits) for associated individuals or institutions related to specific vacancy 
movements would need to be multiplied by the predicted number of such 
movements (from a from-to matrix), and the resulting products added 


146 CHASE 


together. This procedure could predict, for example, the aggregated total sales 
taxes and registration fees paid to a state government, or the aggregate income 
to real estate agents as a result of the chains started by the sales of a specified 
set of new cars or houses, respectively. 


Vacancy Chains and Career Processes 


In vacancy chain systems the chains themselves, in conjunction with a few 
other features of the systems, largely determine the career chances of the 
respective human and animal populations. Consider the set of resource units 
or social positions occupied by incumbents Df a vacancy chain system at some 
specific point in time. In most systems this set probably looks like a typical 
organizational pyramid with relatively fev units or positions of the largest 
size or highest status, more of intermediate sizes or statuses, and more yet of 
the smallest size or lowest status. As vacant units new to the system enter— 
houses just built, jobs just created, snail shells just emptied—and as the 
occupants of units already iri the system pexmanently leave—they die, retire, 
move into institutional housing— vacancy caains are created. These vacancies 
trickle down to create a variety of mobility opportunities for individuals 
within the system. The number and initial scrata of the vacancy chains created 
influence the career possibilities of the ind viduals already in the system and 
those about to enter. The more chains in tiated at higher levels, the more 
mobility opportunities for individuals thrcughout the system, as the chains 
move downward, and the greater the speed-with which individuals advance to 
high status or large units and positions. On the other hand, if chains are 
mostly initiated at lower levels, individua s already in high status positions 
will see their careers plateau while only those at lower levels will move 
comparatively quickly before they reach a career bottleneck caused by the 
lack of opportunities at higher levels. 

Building upon earlier work by White (1970), Keyfitz (1973), and Stewman 
& Konda (1983), Stewman (19862) presencs the mathematical framework for 
calculating the probabilities for advancement (and lateral moves and de- 
motions) from stratum to stratum within vacancy chain mobility systems. This 
framework assumes the input of events starting chains as an exogenous 
variable. Stewman (19862, 1985) uses his techniques to draw “Venturi tubes” 
showing empirical estimates of the probability of advancement in several 
organizations. Three sample Venturi tubes are shown in Figure 2 for systems 
with four strata. These Venturi tubes grapaically display the probabilities of 
mobility between various strata: The hourglass shape on the left indicates a 
system with a restricted probability of uoward mobility for individuals at 
intermediate status levels; the diamond shzpe shows restriction at the highest 
and lowest levels but relatively easy movement at the intermediate level; and 
the pyramid decreasing mobility chances as an individual moves upward. 
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Probability of upward mobility: 
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Figure 2 "Three possible shapes for Venturi tubes in a mobility system with four strata. Stratum | 
lis the highest ranking stratum and stratum 4 the lowest ranking. The widths between the sides of 
the tubes at the various gateways between strata, marked by the dashed lines, indicate the relative 
probabilities of upward mobility for an individual moving from one stratum to another. See the 
text for further details. 





In applying his techniques Stewman (1986a) discovered a particularly 
striking and unexpected finding. Although a large number of Venturi tube 
shapes are theoretically possible, in real organizations he found only a few 
actual shapes: different organizations showed similar mobility patterns. Stew- 
man found that the probability of mobility did not decrease the higher an 
individual rose in an organization, as would seem to be dictated by the shape 
of the organizational pyramid. Instead, individuals reaching intermediate 
levels often had greater probability of advancement than individuals at lower 
levels (see S. Spilerman & T. Petersen, submitted for publication, for a study 
of upward mobility chances by gender in a large corporation using different 
methodological techniques). 

Other researchers have also explored career movements and attainments in 
vacancy chains systems. Sørensen (1977) proposes a model of attainment in 
Systems with an exponential distribution of positions in which individuals 
move upward one stratum at a time through vacancy chains as other in- 
dividuals leave the system. In contrast with Stewman's empirical in- 
vestigations. of several formal organizations, S¢rensen’s model suggests rapid 
growth in attainment at the beginning of careers, slower growth later, and 
finally a stable plateau: Abbott (1990) uses vacancy chain theory for historical 
investigations of the careers of managers in psychiatric institutions. He 
estimates the probability of advancement and the time required for advance- 
ment in order to track historical trends and isolate elites within the larger 
group of managers. Stewman & Yeh (1989) have recently developed a 
technique that traces out the various occupational positions individuals are 
predicted to move through during their careers in an organization. 


Vacancy Chains and Organizational Demography 


In systems in which they operate, vacancy chains have a strong influence 
upon organizational demography: Such factors as length-of service distribu- 
tions for individuals within the organization, rates of advancement within the 
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system, the age distribution of individua within particular strata, cohort 
effects for mobility within the system, and waiting times for advancement 
(Stewman 1981). Stewman’s (1988) recert review in this same series pro- 
vides a detailed discussion of the interrelationship of organizational demogra- 
phy and vacancy chains examining both empirical results and theoretical 
developments in four areas: intraorganizational demography, interorganiza- 
tional demography, individual careers, ani linkages between organizations 
and external populations. Although the techniques of organizational demogra- 
phy have been developed for the analyzis of human populations, these 
methods promise more adequate understancing of the demography, behavior, 
and ecology of animals getting resources through vacancy chains than do 
traditional biological formulations (Weissburg et al 1991). 


ELABORATIONS AND EXTENS-ONS OF VACANCY 
CHAIN THEORY 


Researchers have extended and elaborated the underlying theory of vacancy 
chains to explore a variety of important issues in sociology. Harrison (1988, 
1990) has expanded the basic conception of vacancy chains to reconceptualize 
our understanding of occupational mobilty in the national labor market. 
Harrison does this by going from the idez of a vacancy chain to that of an 
opportunity chain. His data do not allow him to trace specific vacancy chains, 
but they do allow him to tabulate the occupational type of a worker's present 
job by that of his previous one. From this information he forms a transition 
matrix showing the probabilities that a job ~acancy in one type of occupation- 
al grouping will move to a job in the same or another occupational grouping. 
A vacancy in one grouping “calls” a worker from another grouping, which 
calls yet another worker, and so on to c-eate a chain of opportunities for 
advancement. Harrison's (1988) findings corroborate those of other research- 
ers (Stewman 1986b, Sommers & Eck 1977, Rosenfeld 1979) by indicating 
that for the national labor market often about two workers experience occupa- 
tional mobility for every vacancy occurring, and that there is some indication 
that a multiplier effect of 2.0 may be a lower bound on the average number of 
workers experiencing mobility (Stewman 1988). 

Harrison (1990) uses opportunity models to investigate "structural" mobil- 
ity in the national labor force, exploring the impact of economic development, 
the change in demand for various kinds of occupational groupings, on worker 
mobility. The great contribution of this work is that it moves nearer to the 
actual mechanisms behind mobility than many previous studies of occupation- 
al mobility in the national labor market. He builds upon this contribution by 
analyzing the recent development of the urdan underclass and develops a new 
analysis suggesting, in contrast with the prevailing wisdom, that the rise of 
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the underclass is not rooted in the simple absence of mobility opportunities 
but in the nature of the opportunities themselves (low-paying with short or 
unstable tenures) and the lack of historically occurring mobility pathways out 
of low-status occupational groupings (Harrison 1990). 

Stewman’s (1985) analysis of mobility within the civilian work force of the 
federal government takes a similar path. He does not trace individual vacancy 
chains but puts together data indicating the probabilities of a vacancy in one 
occupational grouping calling a replacement from another grouping. Using 
this technique he finds that none of the occupational groupings in the gov- 
ernment’s civilian work force are independent: Job events in one grouping 
reverberate to affect other groupings, and all are interconnected by the flow of 
mobility opportunities. 

Several researchers have used vacancy chain studies to investigate equal 
opportunity of access by different groups to mobility opportunities. Lansing et 
al (1969) and Marullo (1985) have investigated the access of income and 
racial groups to available housing units, and both conclude that African 
Americans are relatively disadvantaged in housing chains. Abbott & Smith 
(1984) have examined vacancy transitions to determine if the probability of 
filling positions is the same across gender. Abbott (1990) discusses the use of 
vacancy chain analysis to track elite groups within professions, reasoning that 
vacancies in elite positions should produce longer chains, and he uses these 
techniques in charting the historical development of the supervisory profes- 
sion for mental health hospitals in the eastern United States. 

Vacancy chain theory has also been used to explore issues at the border of 
sociology and economics. Smith (1983) uses vacancy chain analysis to in- 
vestigate market segmentation for college athletic coaches and shows that 
there are two markets—one core and the other peripheral—that spread across 
firms (colleges) rather than being internal to separate firms. Akerlof et al 
(1988) incorporate the concept of vacancy chains into a series of micro- 
economic models explaining cycles of labor turnover. 


OUTSTANDING PROBLEMS IN VACANCY CHAIN 
ANALYSIS 


One of the main obstacles to vacancy chain analyses of mobility is the scarcity 
of data sets. As mentioned above, researchers following vacancy chains in 
humans usually reconstruct chains from labor- and time-consuming reviews 
of organizational records. Some of these records are public, such as year- 
books for ministers (White 1970) and athletic coaches (Smith & Abbott 
1983), but others, particularly for private organizations, are not, and access is 
a problem. Some researchers (e.g. Harrison 1988, 1990, Stewman 1985, 
Marullo 1985) have avoided the problems of tracing chains from beginning to 
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end by gathering data just on transitions—comparing a person's present and 
former positions—from large, previously collected data sets. Inexpensive 
optical scanners available for microcomptters could help those researchers 
assembling complete chains from organizational records. These scanners 
could directly read records and transfer their contents to computer files for 
tracing of chains (Abbott 1990 mentions the use of computer routines for 
assembling chains from lists of positions 2ntered and left). 

Data on car registrations are already on computers for probably all states, 
and in New York at least, there is public access via computers to these data. 
Similar data sets appear to be available for private airplane and pleasure boats, 
which are also probably distributed via vacency chains. These data sets would 
make it possible to trace chains in major consumer goods without the exten- 
sive interviews used previously (e.g. Lanzing et al 1969). 

Studying vacancy chains in animals presents fewer problems of access, but 
the trade-off is that it is often difficult to permanently identify either in- 
dividuals or their resource units. The studies of vacancy chains in hermit crabs 
(Chase & DeWitt 1988, Chase et al 1988, Weissburg et al 1991) solved this 
problem by choosing an animal in which ckains progressed quickly. Recently 
developed, insertable and computer readable microchips promise help in 
permanently identifying individuals and resources (BioMedic Data Systems, 
Maywood, N.J.). 

A second outstanding problem is the provision of detailed predictions about 
career dynamics such as the length of service distributions for individuals at 
various strata, the distribution of waiting times for advancement for in- 
dividuals at various strata, and the distribution of times at each strata for 
individuals completing their careers. Finding general analytic solutions for 
these problems of career dynamics appears difficult, but some more elabo- 
rated versions of queuing theory, such as network queuing models, look 
helpful (M. Sobel, personal communicatior). Such models could also provide 
information on the analogous but untouched problem of the "career" dynam- 
ics of resource units and social positions: the distributions of occupation times 
and life spans of jobs, cars, houses, shel5, etc. 

A third problem is the identification of state boundaries for Markov mod- 
els. How should a researcher group various levels of resource units or social 
positions so as to provide homogeneous strata? As mentioned above most 
researchers have used a combination of exploratory data analysis and their 
knowledge of a particular system in selectirg boundaries. Harrison (1989) has 
recently developed an innovative new approach to this problem using 
blockmodeling techniques borrowed from the studies of networks, and the 
adaptation of Rosenzweig's (1986) methocs to forming boundaries was men- 
tioned above. 
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THEORETICAL ASSUMPTIONS AND IMPLICATIONS OF 
VACANCY CHAIN ANALYSIS 


We now have empirical demonstration that vacancy chains are responsible for 
the distribution of several resources important for humans: jobs, houses, and 
cars. Although they have not yet been investigated, many other resource 
systems appear to have the qualities necessary for distribution through va- 
cancy chains: major consumer goods such as pleasure boats and private 
airplanes and major pieces of industrial equipment such as pile drivers and 
state-of-the-art medical imaging devices. Further, Sørensen (1983) suggests 
that vacancy mechanisms apply to mobility in educational systems for tracks 
and other educational grouping, and Stewman (1988) argues vacancy for- 
mulations can apply to flows involving prison populations, voluntary groups, 
church memberships, nursing homes, and school districts. We also know that 
at least one type of nonhuman animal, hermit crabs, gets resources through 
vacancy chains. Avariety of other animal species utilize resources with the-- 
requisite qualities for vacancy chain allocation including some that are exotic 
by usual sociological standards: snails living in barnacle shells and rock 
crevices, snapping shrimp living in sea anemones, mantis shrimp occupying 
burrows, octopuses inhabiting gastropod shells, and blennies living in holes 
on coral reefs (see Chase & DeWitt 1988 for a review). 

The analysis of mobility in all these systems begins with a deceptively 
simple assumption: movement is contingent upon the presence of vacant 
positions. The actual, concrete mechanism of mobility in vacancy chain 
systems is the filling of vacant positions, and the filling of one vacant position 
means that another is opened and must be filled in turn. Vacancy chains are 
linkages of causes and effects in the mobility of individuals, and a good 
theory of mobility in those systems must incorporate this most basic of 
understandings. As Harrison White (1970: 328) indicates: "The core idea 
behind vacancy models is the need to trace social processes at a microscopic 
level of social structure to obtain valid causal theory." Similarly, Sgrensen 
(1977, 1983) and Sgrensen & Tuma (1981) argue in the strongest terms that in 
order to understand the processes of inequality and attainment, we need to 
develop a more adequate “sociological conception" of mobility, one based 
upon the actual vacancy mechanisms by which positions are allocated in many 
systems. They suggest that, when we examine mobility systems with "closed 
positions", i.e. systems in which vacancy mechanisms operate, we need to 
reformulate fundamentally our standard models in which isolated, in- 
dependent individuals attain positions contingent upon their personal charac- 
teristics. They see the more usual sociological models of attainment and 
economists' theories of human capital as applying in systems with "open 
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positions", i.e. those with market competit. on for positions where individuals 
are in fact selected contingent upon their personal attributes in comparison 
with all other eligibles (also see Abbott 1920 for a discussion of independent 
individuals versus interdependent ones in contingent mobility processes). 

If in many systems individuals do not get resources as independent and 
isolated entities, but their mobility opportunities are contingent upon the 
movements of other individuals and linked together in lawful ways, then 
adequate theoretical conceptions and methodological techniques must reflect 
these facts. Much of the standard theory and methodology in status attain- 
ment, including the newer developments incorporating structural variables, 
such as characteristics of firms, and life ev2nt history analysis do not yet take 
into account the contingent nature of mobil ty opportunities. This present lack 
of response to the underlying mechanisms cf mobility may unnecessarily limit 
our understanding of the important processes that these approaches investigate 
(see Stewman 1975a,b, 1978, Konda & S:ewman 1980, Stewman & Konda 
1983 for similar points and tests of vacanzy versus other mobility models). 

Vacancy chain models themselves are mot immune to criticism, however 
(White 1970). First, the present Markov models do not treat the actual, 
detailed influences at work in selecting particular individuals for particular 
vacant openings. For humans these influences include reputations, networks 
of acquaintance, information, and having the right attitudes; for animals, 
individual differences in searching behavior, aggressiveness, ability to pro- 
cess sensory stimuli indicating the preserce of vacant resource units, and 
overall physical condition would be important. Second, the Markov models 
presently used are quite crude. Positions are roughly grouped inte a few broad 
strata, probabilities of a vacancy moving between the various states have to be 
measured empirically and cannot be predicted from first principles. In most 
studies the distribution of vacancies entering a system is treated as an ex- 
ogenous variable and not predicted within vacancy chain theory (but see 
Stewman 1988, Weissburg et al 1991 for some ideas about theoretically 
predicting the entrance of vacancies into a system). Third, vacancy chain 
theory, in its present formulation, does not lend itself to answering questions 
about the equality of access of various groups to mobility opportunities, and 
these questions motivate, in part, some of the more traditional approaches to 
mobility (but see the exceptions noted above). 

While meeting these criticisms would further strengthen the present meth- 
odology, vacancy models have already demonstrated remarkable similarities 
in the social organization of several human and animal mobility systems, and 
vacancy chains are the central concept in linking together a wide range of 
social processes. The present successes suggest the future contributions of the 
vacancy conception for understanding the distribution of inequality, career 
development, organizational demography, and mobility processes for humans 
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and animals. I suggest that the most valuable of these future contributions 
would be the extension of vacancy chain theory in two areas: (a) further 
investigations of the common properties of all vacancy chain systems includ- 
ing in particular those topics described above but not yet treated: aggregate 
effects for primary and associated individuals and institutions and the career 
dynamics of individuals and their resource units and social positions, and (b) 
studies of the fundamental, underlying structural properties causing the 
already observed similarities among several human and nonhuman vacancy 


chain systems. 
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Abstract 


This review examines one of the most fundamental issues of family history, 
the nature of domestic groups in which people lived in the past. The focus is 
further limited to the evolution of family forms in Europe. Although such 
models as those originally proposed by Laslett and Hajnal for western family 
history have been shown to be wanting, they have served an invaluable role in 
stimulating and guiding family history research. We are now able to begin to 
grasp the contours of a much more complex western family heritage than 
earlier scholars recognized. 


INTRODUCTION 


Western family history has long intrigued sociological theorists, ranging from 
the evolutionists of the nineteenth century to feminists today. Until recently, 
sociological theorizing on changes in family life in the west was based 
primarily on speculation and folk wisdom. The boom in family history which 
began in the 1960s and gathered steam over the past two decades has provided 
unparalleled opportunities for sociologists to take a new look at family life in 
the past and the forces responsible for molding family organization. Many 
sociologists have indeed dipped into this historical literature in the interest of 
drawing sociological lessons, but their work has been hampered by the speed 
with which new historical evidence is changing previous understandings of 
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family life in the past. As a result, new gereralizations which are historically 
untenable are being introduced into sociology. 

In this review, I examine one of the most fundamental issues of family 
history, the nature of the domestic groups .n which peopie lived in the past. 
Here, focus is further limited to the evolution of family forms in Europe, a 
field of research that has had a major impac- on the development of sociologi- 
cal theory. Of special interest in this conmection are the changes that took 
place with the transition from agricultural to industrial society from the 
eighteenth to the early twentiéth centuries. 

While our focus is on forms of coresidence, it is important to keep in mind 
the complex relationship between coresidential groupings and the workings of 
various kinship principles. Household and family, for example, are not syn- 
onyms, and when the two are confused aralytical and theoretical confusion 
can result. Household refers to the group of coresidents, people who live 
under the same roof and typically share in common consumption. Family is a 
much more ambiguous term; it refers to close kin, but the exact reference of 
the term tends to vary contextually. From an anthropological viewpoint, the 
key distinction is between a domestic grou», on the one hand, and a kinship 
system—with its attendant categories of Lin—on the other. Very different 
kinship systems may have identical domestic groups, or households, and 
similarly, the same kinship system may result in a diversity of households at 
any one time. Kinship systems are themselves, however, related to rules of 
coresidence in rather complex ways. For 2xample, there is a link between 
patrilineal kinship systems and patriloca. postmarital residence systems. 
However, even a knowledge of residence -ules of this sort cannot, in itself, 
lead to a prediction of the frequency of any particular household form. 
Households are not simply the product of residence rules but are also affected 
by demographic, life course, and politica! economic factors, as is detailed 
below. The historical study of family or Ein relations involves much more 
than research on household forms, for kinstip relations are not constrained by 
coresidential boundaries. In this review, however, we limit ourselves to the 
study of household forms; we ask how these vary, and how we can explain 
this variation. 

We concentrate here on research developments over the past decade, 
though we first briefly review the work that, 20 years ago, radically. altered 
traditional sociological ways of looking at family life in the rural past. I argue 
that the understandings that gained curreacy in the 1970s, and that have 
subsequently become widely accepted armong sociologists, can no longer 
withstand historical scrutiny. I show (aJ why those who claim that the 
European preindustrial past was characterized by nuclear family households 
have misled us; (b) that the whole enterorise of branding major areas of 
Europe as having a particular type of housekold system is misleading, and that 
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there are theoretically significant differences in household forms within such 
regions; and (c) that rather than focusing on the limited issue of whether or not 
industrialization entails household nuclearization, we need to address the 
basic question of what determines coresidential arrangements in any time or 
place. In this effort, I suggest, we need to adopt an approach that focuses on 
the interplay of political economy, demography, and culture. 


THE NEW ORTHODOXY 
On Myths Old and New 


Sociologists have long delighted in disabusing us of “myths” of family life in 
the past. In his influential work of three decades ago, Goode (1963 : 239) took 
issue with the image of the "classical family of Western nostalgia," noting 
that in every generation, people think of family life in the past as having been 
wholesome and untroubled in comparison with family life in the present. 

Peter Laslett (1965, 1972), later joined by various British colleagues, 
attacked another “myth” of family life in past time, the notion that European 
society had for centuries been characterized by large, complex family coresi- 
dence. Arguing that, on the contrary, western family life had long been lived 
in nuclear family households, Laslett rejected the previously dominant view 
that industrialization brought about family nuclearization. Indeed, many 
scholars began arguing that causality worked in the opposite direction, that 
the preexisting nuclear family system made Europe fertile ground for the rise 
of modern industry (Greenfield 1961, Wrigley 1977, Lee 1982, Todd 1985, 
Levine 1985). Further turning conventional wisdom on its head, Anderson 
(1972, 1978) maintained that industrialization and urbanization often brought 
about an increase in extended family living arrangements. 

The impact of Laslett’s work on sociology has been impressive, if often 
misinformed. In a recent review of family history, Cherlin (1983:52) cites 
Laslett as his source in concluding that the finding of the universality of the 
nuclear family in preindustrial Europe "is by now so well-known and well- 
established as to require little elaboration." In sociological texts on the family, 
the new orthodoxy now reigns. Goode (1982:95) concludes that in “Western 
societies, for hundreds of years in the past, neolocality has been the rule.” 
Skolnick (1987: 124) assures us that “The myth of the large extended family 
household in times past was laid to rest by Peter Laslett and his associates." 
Similarly, Collins (1988: 120) refers to the “myth” of the extended family in 
the European past, and Duvall & Miller (1985:10) write that "Contrary to 
popular opinion . . . contemporary scholars are quite certain that the majority 
of European . . . families were rarely extended to include nonnuclear rela- 
tives." These examples could be multiplied many times. 
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While Laslett’s early work has been transformed into a new sociological 
orthodoxy, its progenitor has been rapidly retreating from it in the wake of the 
growing body of historical studies in Europe that show a much more complex 
and interesting picture than Laslett’s original formulation suggested. 

Not one, but several different systems of family and household organiza- 
tion characterized preindustrial Europe. The most influential early statement 
of this proposition, albeit in indirect form. came from Hajnal (1965), who 
heralded a European marriage pattern which was unique in the world, and 
which "extended over all of Europe to the west of a line running roughly from 
Leningrad . . . to Trieste" (p. 101). The primcipal features of this system were 
a late age for females at marriage and a large proportion of people who never 
married. Although with understandable A3glo-centrism, Hajnal referred to 
this as the European marriage pattern, his model made clear that another 
marriage system was to be found in eastern Europe. 

More recently, Hajnal (1982) has takea his original argument further, 
distinguishing two kinds of household formation system in preindustrial 
times. In Northwestern Europe, a simple &ousehold system was marked by 
late age at marriage for both men and wormen, neolocality, and a premarital 
life stage in which youths circulated among households as servants. Hajnal 
contrasts this with a joint family household system, found in parts of Asia, 
with earlier marital age for women (under 21 as opposed to over 23 for the 
simple household system), patrilocality, and hence the coresidence of two or 
more component conjugal family units. Note the key role played by female 
age at marriage in Hajnal’s model, for it is tais variable that links demograph- 
ic constraints to household composition. This is based on the proposition that, 
given high mortality rates in preindustrial times, in areas where women marry 
late there is little opportunity for three-generation households to form. Citing 
the insufficiency of evidence outside northwestern Europe, Hajnal makes no 
commitment as to the kind of household system that characterized southern 
and eastern Europe in the past, though he notes the presence of a joint family 
system in fifteenth-century central Italy (Herlihy & Klapisch-Zuber 1985). 
Meanwhile, Richard Smith (1981), influerced by Hajnal's work, expanded 
on this observation to propose a Mediterranzan marriage pattern, with the age 
at marriage for women earlier than that fourd in the northwest of Europe and, 
consequently, higher proportions of complex family households. 

Here we may pause to define some of the key terms used in historical 
household analysis. The nuclear family household consists in its full form of a 
married couple and their children. This is contrasted with the coraplex family 
household, which includes kin beyond the nuclear family. Historians have 


HOUSEHOLD HISTORY 159 


tended to distinguish between two kinds of.complex family households. In the 
stem family form, one child, and one child only, brings his or her spouse into 
the parental household. In the joint family form, more than one child (general- 
ly all the sons) are supposed to bring their spouses into the parental house- 
hold, leading to a coresidential unit that may have more than two component 
conjugal units. Using a somewhat different approach, Hammel & Laslett 
(1974) distinguish between complex family households that are extended and 
those that are multiple. An extended family household consists of one in 
which kin beyond the nuclear family are present, but there is only one nuclear 
family unit in the household (e.g. a nuclear family plus the wife's widowed 
mother). A multiple family household consists of two or more coresiding 
nuclear family units. 

In the face of growing evidence for a diversity of family systems in 
preindustrial Europe, Laslett (1977, 1983) moved away from his earlier 
emphasis on "the European family," identifying instead four different sys- 
tems: (a) northwest; (b) west/central; (c) Mediterranean; and (d) eastern. 
According to Laslett, the nuclear family household dominated in just one of 
these four regions, the northwest, though it was also found in the west/central 
region. On the other hand, neolocality was uncommon in Mediterranean and 
eastern Europe. Following Hajnal's logic, Laslett argued that the nuclear 
family household system of the northwest was propped up by a high female 
age at marriage (see also Wall 1983), and marked by a pattern of premarital 
life-course service. In eastern and Mediterranean Europe, by contrast, low 
female age at marriage and an absence of life-course service accompanied a 
system of complex family households. 

French scholars never joined this revisionist bandwagon (Burguiére 1986, 
Collomp 1988), a fact partly attributable to LePlay's (1982 [1872]) legacy. 
However, what really prevented French family historians from embracing the 
new view was that preindustrial France was characterized by a wide diversity 
of household syste:»is, a fact the French historians could hardly ignore. 


OVERTURNING THE NEW ORTHODOXY 


The Northern European Evidence 


Confronted with the growing evidence against them, proponents of the new 
orthodoxy have attempted to salvage their thesis in a number of ways. Some, 
following Hajnal's earlier efforts, simply exclude the eastern portion of the 
continent, identifying western Europe with Europe in general. One problem 
here, as we shall see, is that southwestern Europe does not fit the nuclear 
family model. This has been dealt with by excluding Iberia, southern France, 
and Italy from western Europe, in a rather odd redefinition of historical 
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geography. The logical outcome of this retreat is what we find in Laslett's 
(1983) new approach, with the nuclear family household system said to 
characterize only northwestern Europe. 

Yet, even if we limit consideration to this portion of the continent, it is by 
no means clear that the nuclear family household system was everywhere 
dominant. We know, for example, that cemplex family household systems 
characterized northern Sweden for centucies (Egerbladh 1989) and stem 
family households were prominent in Irelard (Gibbon & Curtin 1978, Varley 
1983). Jannsens (1986) followed members of a Dutch community over their 
life course in the late nineteenth century ard found that a majority spent part 
of their lives in extended family househelds of some kind. But it is the 
evidence from the rest of Europe that most effectively destroys both the notion 
that preindustrial European family life was based on a nuclear family house- 
hold system and, with it, the thesis that the process of industrialization could 
not have entailed household nuclearizatior. 


The Evidence from Southern Europe 


Throughout the 1970s, historical sociologists regularly lamented the dearth of 
information on southern European family systems. Partly in response to Peter 
Laslett’s stimulus, though, the 1980s witnessed tremendous growth in histor- 
ical family studies in southern Europe, end many studies focused on the 
household. Laslett’s (1983) proposal of a Mediterranean family system, 
characterized by complex family households, young female age at marriage, 
and lack of life-course servants, came in response to some of this work. The 
actual historical evidence, however, points to a much more complex reality. 
We look here at the evidence for Italy, Iberia, and southern France. 

There is now no doubt that complex family households predominated in 
major portions of Italy. Most notable is the sharecropping belt extending 
throughout much of central Italy and into scme of the northern regions. There, 
in many communities of the eighteenth and nineteenth centuries, a majority, 
sometimes a large majority, of people lived in households containing two or 
more component conjugal families (Torti 1981, Doveri 1982, Angeli 1983, 
Barbagli 1984, Kertzer 1984, Kertzer & Bxettell 1987, and Tittarelli (1991). 
This system resulted, in part, from the pover held by landlords, who sought 
families that would improve their return on the land. With sharecropping 
contracts generally renewable at the landlord’s pleasure every year, house- 
holds were under heavy pressure to maximize their adult family workforce or 
risk losing their farms. 

Complex family households were also ccmmonly found elsewhere in north- 
ern Italy, where sharecropping was not practiced. Complex family households 
were thus not limited to a sharecropping economy. Most often, such complex 
family arrangements were found in settings—especially common in the 
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mountains—where people were under ecological pressure to diversify their 
family economy (Ramella 1977, 1983, Mabilia 1980, Sella 1987). A variant 
of this pattern has been examined by Viazzo (1989, Viazzo & Albera 1990) in 
the Western Alps, where multiple family households provided the labor 
power to divide the family work force between agriculture and pastoralism. 

It is not just the existence of non-nuclear household systems that makes 
Italy such an interesting case, but also the diversity of systems found there. 
Indeed, Barbagli (1991) has identified three different household formation 
systems in Italy operating until the early twentieth century. The first, illus- 
trated by the cases discussed above, combines patrilocality and late marriage 
age and characterizes much of central and northern Italy. The second, com- 
bining neolocal residence and early female marriage, predominated in south- 
ern Italy, while the third, combining neolocal residence and late female 
marriage age, typified Sardinia. Da Molin (1990) ties the nuclear family 
system of the south to various economic and ecological forces, including the 
large mass of landless or near-landless laborers, and the tendency for agricul- 
turalists not to live on the land they farm, but in densely settled agrotowns. In 
transitional areas, such as Molise, lying between central and southern Italy, 
rates of complex family households are higher than in the south, but lower 
than in central Italy (Douglass 1980, 1991). 

The Italian evidence undermines Laslett's and Hajnal's newer formulations 
regarding European marriage systems. Hajnal's hypothesis of a connection 
between neolocality and late female marriage age is belied by the fact that it is 
in southern Italy, where neolocality was the norm, that women married 
young, while in central Italy, where joint family households were the norm, 
female marriage age was similar to northwestern European levels. Moreover, 
the tie between a young female marriage age and low proportions of people 
who never married is belied by southern Italy, which had relatively high rates 
of celibacy (Benigno 1989). Finally, the association between neolocality and 
life-course service is undermined by the fact that such service was common in 
the joint family households of sharecroppers in central Italy, but rare in the 
nuclear family households in southern Italy. 

Curiously, in a geographical pattern which both parallels Italy and reflects 
poorly on the notion of a uniform Mediterranean complex family household 
system, nuclear family households in Iberia predominated in the south, with 
more complex family households typifying many parts of the north. Lisón- 
Tolosana (1977) identified a patrilocal stem family system as the norm 
throughout much of northern Portugal, the northwestern Spanish provinces of 
Galicia and Asturias, the Basque country of northeastern Spain, and Aragon 
and Catalonia. Subsequent research has largely borne this out (Douglass 
1988a,b, Flaquer 1986, Comas d' Argemir 1988). The stem family household 
is linked to an impartible inheritance system, found in the north, which 
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contrasts with the partible inheritance of the south. Moreover, the south was 
characterized by large landholdings and many landless farm laborers (Reher 
1988, Carrion 1988, Bastos 1988). However, even in the nuclear family 
household system of the south, newlyweds »ften began marital life coresiding 
briefly with the parents of either bride or groom (Reher 1987). 

Like Italy, Spain, and Portugal, France too had a diversity of household 
systems in the preindustrial past. As a gereral rule, neolocality and nuclear 
family households, linked to partible inheritance, characterized the north, 
while much of the south, with its tradition of impartible inheritance, had 
complex family household systems. Varioas forms of the stem family were 
prevalent in different parts of the south. In general, a single son (typically, but 
not always, the oldest) remained in the parental household after marriage, 
inheriting the farm (Fauve-Chamoux 1987, Collomp 1988). In some parts of 
the south, however, a daughter could inhezit, bringing in her husband (Fine 
1988, Lebrun 1988). 

In the sharecropping areas of central France, on the other hand, as in 
sharecropping Italy, joint family householcs were common. Here, there was 
no limit to the number of sons who could »ring their wives into the parental 
household, and households could becoms considerably larger than those 
found in stem family areas (Shaffer 1982, Darrow 1989; Bideau et al 1986). 
Finally, while nuclear family organization predominated in the north, varia- 
tions on the pattern can easily be found. For example, Fine (1988:407—408) 
discusses the practice in Breton of each child, in turn, bringing his or her 
spouse into the parental household upon marriage. The newlyweds were 
displaced upon the marriage of the next sbling. 


The Evidence from Eastern Europe 


Although eastern European family history -emains less well known, we now 
do know that a diversity of household forms prevailed there in preindustrial 
times. It is symptomatic of the expansion m eastern European family history 
that while Serbia was the only eastern Eurcpean area represented in Laslett & 
Wall’s (1972) landmark collection, the more recent Cambridge Group collec- 
tion (Wall et al 1983) contained chapters on five different east European 
societies. 

The paradigmatic east European case is described by Czap (1978, 1982, 
1983) who examines Russian serf society in the late eighteenth and early 
nineteenth century. The Russian case belies the common sociological claim 
that, even where patrilocality is the cultural] norm, demographic constraints 
would prevent people from spending much of their life in complex family 
households. Indeed, Czap describes a system of "perennial" complex family 
households, in which most people spent tre majority of their lives in house- 
holds containing at least two simple, patrilaterally related family units. 
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Women married early and, in part due to landlord pressure, marriage did not 
involve founding a new household, but rather the expansion of a pre-existing 
one. 

Although Czap proposed this as a model for an eastern European household 
type, this is not the only kind of household system found in eastern Europe, a 
point made earlier by Sklar (1974). While accepting multiple family house- 
holds as a regional characteristic of eastern Europe, Plakans (1987) notes that 
in many areas household complexity was, in fact, only found at a particular 
phase of family development. Strikingly, different communities within the 
same areas could vary substantially in the prevalence of complex family 
households (Plakans 1983). Thus, Palli (1983) finds complex family house- 
holds common in southern Estonia but nuclear family households typical of 
northern Estonia. The most thorough study in eastern Europe of this kind of 
variability has been carried out by Andorka and colleagues (Andorka & 
Faragó 1983, Andorka & Balázs Kovács 1986) on preindustrial Hungary. 
They find that complex households typified areas that had serf-peasant pop- 
ulations, while nuclear family households were more typical of communities 
of landless cotters. 

What all this adds up to is that eastern Europe, like western Europe, 
displayed a diversity of househọld systems in preindustrial times, and these 
were linked in part to regional differences in political economic arrangements 
and ecological conditions (Mitterauer & Kagan 1982). Many of the prevailing 
generalizations about the “myths” of European family history turn out to be 
themselves ill-informed and lead to a dead end in construction of sociological 
theory. Beginning with this understanding of the actual diversity of family 
systems in the European past, we can reconsider just what forces may be 
responsible for producing the household systems that shaped western family 
. life. The following sections are devoted to two of these forces: economic 
factors, and the influence of demography. 


THE ROLE OF ECONOMIC FORCES IN DETERMINING 
HOUSEHOLD SYSTEMS 


The Household as Labor Force 


The traditional model of rural life in preindustrial times depicts a society in 
which peasant households were the unit of production. Indeed, the pre- 
industrial household is often contrasted in this regard with the modern house- 
hold, the latter being a unit of consumption, but not of production. Historical 
work over the past two decades has overturned this simple dichotomy, with 
the growing realization that a good deal of preindustrial agriculture involved 
landless laborers for whom the household was not a unit of production. 
However, since preindustrial households often did serve as units of produc- 
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tion, it is important to ask in what ways household composition may be 
explained by labor force requirements of the household as an economic entity. 
Here Chayanov's (1986) work has been infliential, with its insistence that the 
peasant family follows its own, noncapitalist logic (see Thorner 1986). 

In a household-as-labor-unit model, the composition of the household is a 
product of the labor demands of the econorric operation, whether it be a farm 
or a protoindustrial home workshop. A good example is provided by attempts 
to understand the diversity of household forms in the Alps, where complex 
family household systems predominated in zome areas and nuclear systems in 
others. Viazzo (1989) emphasizes the delicate balance between farming and 
herding in Alpine communities, along wita the need to scatter agricultural 
plots in different microclimatic locations to minimize risks. He concludes that 
the vulnerability of a household that cannct provide a labor force sufficient 
both for herding and for farming in scattered fields places heavy pressure on 
people to come together in complex households. How, then, can Alpine 
villages typified by nuclear family coresiGence be explained? In the Swiss 
community of Tórbel, studied by Nettimg (1981), the same economic/ 
ecological pressures are dealt with differently, demonstrating the complex 
interplay of economy, ecology, and culture. Rather than each household 
having to provide a workforce sufficient fo- both farming and herding, herds 
are tended communally, and nuclear famify households are thus viable. 

A parallel example comes from Sardin:a, where recent research (Oppo 
1990) reveals that two residential systems co-existed. A nuclear family 
system prevailed in the cereal-producing villages of the coastal plains, while 
in the mountains, residence was organized around uxorilocal principles. 
Husbands either moved into their wives' natal home or, more commonly, a 
cluster of homes took shape, organized around a group of married sisters and 
their mother. The explanation for this pattern is that, in this pastoral economy, 
women remained in the villages, tending tne family's local holdings, while 
the men spent long stretches of time away ir the mountains herding the flocks. 
In such circumstances, cooperative groups ef kin-related women provided the 
backbone of village economic and social organization. - 


Political Economic Forces 


Family labor force requirements, however, must be placed in a broader 
political economic framework. The influence of such political economic 
forces is perhaps most apparent in easterr Europe, where not so long ago 
serfdom was widespread and the manorial lords controlled the land on which 
people depended. In this setting, the rules zoverning household composition 
were largely determined by the lords, based on their calculation of what 
would best serve their economic and political interests. The complex nature of 
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the Russian serf household, Czap (1982, 1983) tells us, was due to the fact 
that the Jords who controlled the estates discouraged the serfs from dividing 
their households. Plakans (1983, 1984) finds a similar pattern in the Baltic, 
where the serf estate system imposed the rules governing coresidence. With 
the demise of serfdom in the nineteenth century, the percentage of complex 
family households began to decline. 

In Poland (Kochanowicz 1983), where labor was scarce, lords forbade their 
serfs from leaving the village in which they lived, and tried to encourage 
marriages within the estate. On the other hand, because the serfs’ labor 
obligations to the lord were based on household units, the Polish lords 
encouraged neolocality, often against the wishes of the peasants who saw 
advantages in larger, complex family households which would have a lower 
proportion of their productivity taken from them by the lords. As this case 
shows, peasants often struggled against the pressures of landowning and 
political elites, and peasant-household forms must be understood not simply 
as outcomes of elite interests, but also as the product of attempts to circum- 
vent these pressures. Thus, for example, in parts of what is now Czechoslova- 
kia, the conscription of unmarried men led serf families to encourage the early 
marriage of their sons, which led, in turn, to a higher proportion of complex 
family households (Gaunt 1983). 


Inheritance Systems 


Insofar as peasants owned their own farmland, their household strategies were 
directly.affected by state rules governing inheritance, and they were vulner- 
able to changes in inheritance laws which they were in no position to 
influence. France provides the most dramatic instance of such change, with 
the imposition at the beginning of the nineteenth century of the new Civil 
Code, banning impartible (indivisible) inheritance. Impartible inheritance had 
been a pillar of the stem family system found in much of southern France. It 
gave parents the means to pressure a child into remaining with them in the 
household after marriage and gave the child the incentive to do so. Through 
much of the nineteenth century, peasants struggled to find ways to get around 
the new legal requirements, but the effect of the new law was the progressive 
decline in complex family households (Collomp 1988, Fine 1988, Lebrun 
1988, Darrow 1989). 

The diversity of inheritance systems across Europe, together with the link 
between inheritance and household systems, offers an obvious basis for 
explanations of the geographical distribution of different European household 
systems (Gaunt 1987). At the most general level, impartible inheritance 
systems have been linked to stem family households, with partible inheritance 
linked to nuclear family households. Thus, in explaining the stem family 
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system that prevailed in Iceland in the 170Cs, Egerbladh (1989) points to the 
prevailing system of impartible inheritance. Likewise, the emphasis on im- 
partible inheritance in northern Spain has commonly been cited as an explana- 
tion for the presence of stem family househclds in the north, as opposed to the 
nuclear family arrangements that prevailed n the south (Flaquer 1986, Carri- 
on 1988, Reher 1988). 

Not all stem families are organized in th» same way, however; one of the 
major distinctions among them regards the t.ming of transmission of authority 
over the property. In some German-speakimg regions, the heir took over the 
headship of the household upon his marriage, with the parents effectively 
retiring. In southern France, by contrast, the father typically retained control 
over the household until his death (Collomp 1988, Plakans 1989, Sgrensen 
1989). 

To understand the role of inheritance zystems in explaining household 
dynamics in the past, it is important to realize that not all segments of the rural 
population had land they could pass on to their children. We have already 
seen that under serfdom—prevailing in parts of eastern Europe until the 
nineteenth century—it was not inheritance but rights to land obtained by 
relationships with lords that influenced hovsehold formation systems. In the 
Baltic in the late eighteenth century, for exemple, hereditary serfs had certain 
rights to land which could be passed on, anc they thus lived in large, complex 
family households. However, a substantial »art of the population lacked such 
rights; they had little to pass on and lived in nuclear family households 
(Plakans 1983). 

In western Europe, too, an important segraent of the agricultural population 
had rights to land without owning it, and thas their household dynamics were 
not directly tied to inheritance in the usual sense. These were the sharecrop- 
pers, found especially in central and parts of northern Italy and in parts of 
central and southern France. In both countries, sharecroppers were associated 
with joint family households (Lebrun 19€8, Darrow 1989, Kertzer 1977, 
1984, Kertzer & Hogan 1988). This system was unaffected by inheritance 
laws; thus, for example, the above-mentiored changes in French inheritance 
law, which undermined the stem-family sys‘em of the peasant proprietors, left 
the sharecroppers unaffected. Their complex family households were rooted 
not in inheritance but in the political econcmy of sharecropping. As long as 
sharecropping continued, so would the joint family households (Shaffer 
1982). 


Economic Differentiation 


The close link between economic arrangements and household composition 
has implications not only for explaining differences among regions in Europe, 
but also for understanding the diverse systems that coexisted within local 
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populations. Full appreciation of the importance of this differentiation has 
been hampered by the tendency of many scholars to identify whole areas (e.g. 
the Mediterranean, or even central Italy) by a single household system 
(Kertzer 1985). As Anderson (1980: 32—33) put it, “even in peasant societies 
by no means everyone has ever been a peasant." 

Research on protoindustry—the putting-out system of manufacture that 
turned rural homes into workshops throughout major areas of Europe from the 
sixteenth to the eighteenth centuries—has emphasized one such nonpeasant 
population (Mendels 1972, Medick 1976). Aside from the attention they have 
paid to such protoindustrial populations, though, scholars have been so taken 
by the idea of a European countryside populated by peasant farmers that they 
have often failed to recognize the sometimes large populations of landless 
agricultural laborers. 

Studies that distinguish within individual communities between landed 
peasants and landless workers (agricultural or not) have consistently found the 
latter to be characterized by much lower levels of complex family coresidence 
(Comas d’Argemir 1988, Shaffer 1982, Darrow 1989, Sella 1987, Angeli & 
Bellettini 1979, Kertzer & Hogan 1989, Andorka & Balázs-Kovács 1986). 
Landless workers typically lived in tiny, cramped quarters that they had to 
rent. Because adults sold their labor on the market as individuals, they did not 
face the kinds of pressures favoring complex family coresidence felt by those 
for whom the household was the unit of production. 

In societies where peasants owned their owri land, a positive relationship 
generally exists between the size of the holdings and the complexity of the 
household (Brettell 1988, Egerbladh 1989). This should not, however, be 
simply translated into an equation of greater wealth with greater household 
complexity, as some scholars have been wont to do (e.g. Ruggles 1987:39). 
In sharecropping Italy, for example, sharecroppers had larger and more 
complex households than those who owned the farms. Even the commonly 
held assertion that the poor lived in small households has recently been 
reexamined. Sokoll (1987) argues that many of the studies that have made this 
claim are methodologically flawed. In his own study of a late eighteenth- 
century English community, Sokoll finds that pauper households were 
larger than others, because they contained more children, and more com- 
plex than others, in part because they often contained an elderly widowed 
parent. 

One inference we can draw here is that poverty was connected to certain 
points in the life course. Accordingly, it may be misleading to think in terms 
of a distinguishable segment of poor families with its own household system. 
(On the importance of life course perspectives in historical household study, 
see Elder 1978, 1987, Vinovskis 1977, 1988, Kertzer 1986, and Hareven 
1987.) 
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How Nuclear Were Nuclear Family Households? 


Even where stem family norms are widey shared, people may spend a 
significant proportion of their lives living in nuclear family households (Berk- 
ner 1972). This is attributable to life course patterns, since once the older 
generation dies off, and before the members of the third generation can marry, 
the household will be nuclear. What is not commonly recognized, however, is 
that so-called nuclear family systems often eatail spending part of one's life in 
a complex family household. There are a namber of reasons why this is the 
case, but probably the single most important diagnostic element involves how 
the elderly are treated (Laslett 1988). 

Wall (1989 : 374), part of the Cambridge Group which has championed the 
notion that complex family arrangements were rare in northwestern Europe, 
notes the “curious paradox" that while preindustrial English households were 
“overwhelmingly simple," “the majority or older residents still reside with 
relatives." He largely attributes this to the fact that, in the past, because 
parents bore children until relatively late in -heir lives, the elderly were more 
likely to have young unmarried children still living with them. 

However, the weight of the evidence from populations with nuclear family 
household systems suggests that widowed parents were often taken in by one 
of their married children. For example, Darrow (1989) describes the eight- 
eenth-century French case where workers, who generally lived in nuclear 
family households, regularly took in elderly relatives. Similarly, Jannsens 
(1986) actually followed adults through them life course in Tilburg, Holland, 
in the nineteenth century, and found that a large majority spent part of their 
adult life in a complex family household due to the taking in of kin. Old folks 
there rarely lived alone, being taken in eithe- by a married child or by another 
relative. It is notable that Wall (1986:26C) himself finds that in Colyton, 
England, in the mid-nineteenth century, 22% of the farmers’ households 
contained a relative beyond the nuclear family. 

The boundaries of nuclear family households were not nearly so imperme- 
able as we have been led to believe, and kin often flowed in and out, as the 
need arose. Indeed, if we focus on the treatment of the elderly, it may be 
possible to distinguish between two types of nuclear family household sys- 
tems: one in which, upon the death of a spoase, the widow(er) remains living 
alone; and one in which, upon the spouse's death, the widow(er) coresides 
with a married child. 


The Impact of Industrialization 


The new orthodoxy neatly disposed of tbe problem of the impact of in- 
dustrialization on household systems, for if nuclear family houscholds char- 
acterized the preindustrial past, the old thesis of industrialization bringing 
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about household nuclearization could hardly be sustained. Indeed, discussion 
shifted to the opposite perspective, one which argued that industrialization 
and urbanization brought about an increase in extended family living arrange- 
ments. A number of arguments were offered to support this revisionist thesis, 
including that grandparents were needed to take care of small children and the 
household when both of the parents had to be away at work, and that extended 
kin were useful in facilitating the integration of new immigrants to the city 
(Anderson 1971, 1978, Wrigley 1977). 

For the past decade and a half, in fact, the most active area of historical 
research concerning households and industrial work has concerned not the 
impact of modern industrialization, associated primarily with developments in 
the nineteenth century, but rather the implications of an earlier economy, that 
associated with protoindustrialization. In line with the theory that household 
composition is largely the product of economic arrangements, scholars have 
argued that the protoindustrial household had distinctive features based on its 
. relationship to the means of production. 

According to this view, for the protoindustrial laborers as for the peasant 
farmers, the household was the unit of production; all members took part in 
the family enterprise. However, unlike the prototypical peasant family, de- 
pendent on inheritance of the family farm, the protoindustrial workers needed 
little to create their own firm. The result was that parents had less control over 
their children as they grew to adulthood, and there were fewer impediments 
toward marriage for children. Unlike other landless rural dwellers—who in 
many areas sent their children off at a tender age to serve as servants or 
apprentices in other households—the protoindustrial families kept their chil- 
dren at home as long as they could, since their labor could be productively 
used in the family firm (Medick 1976, Levine 1977). 

More recent studies have taken issue with some elements of this pro- 
toindustrialization theory (Gutmann 1987). Some have questioned whether 
protoindustrial households did exhibit distinctive demographic characteristics 
(Houston & Snell 1984, Poos 1986). Of particular importance is the conclu- 
sion found in the more recent literature that in many, perhaps most, areas of 
Europe, there was no separate category of protoindustrial households. Rather, 
a wide variety of both landless and peasant households took up protoindustrial 
labor. 

For example, Pfister's (1989) study of a rural portion of the canton of 
Zurich in 1762 finds that much of the protoindustrial labor was done in 
farming households, and that farming households with protoindustrial activity 
were richer than those without. He argues that protoindustrial activities were 
entered into when the household was at a phase of complex family composi- 
tion, allowing excess labor to be productively occupied in the agricultural 
off-season and generating cash income which could be used to invest in farm 
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improvements. This analysis complements Barbagli's conclusion, based on a 
study of the impact of industrialization on households in Italy, that the advent 
of industrial labor did not necessarily undecmine complex family household 
arrangements, since households could sometimes benefit from them in mixing 
their investments in both the agricultural and industrial sectors (Barbagli 
1984: 107). 

Research on the impact of modern industrialization on household arrange- 
ments in Europe has concentrated almos. entirely on England. There is 
considerable evidence there that modern industrial development and the urban 
expansion that went with it often led to a higner degree of household complex- 
ity, at least until the modern welfare state could develop alternatives to kin 
dependence. However, England cannot be taken as typical of Europe as a 
whole, and thus the old question of the impact of industrialization on house- 
hold arrangements remains unanswered. Ir areas of Europe where complex 
family households prevailed in the countrys.de, the increase in the proletarian 
proportion of the population meant an increase in nuclear family households. 
This does not mean that modern industrialization necessarily entailed the 
advent of a new household system in these areas, since other proletarianized 
segments of the rural population (such as agricultural wage laborers) often 
predated modern industry, and typically lived in nuclear family households 
(Kertzer 1978, Lehning 1980, Accampo 1989). 


THE ROLE OF DEMOGRAPHIC FORCES IN 
DETERMINING HOUSEHOLD COMPOSITION 


The Relevance of Demography 


As we have seen, household arrangements are in good part the product of 
political economic forces, and they also reflect cultural norms regarding 
where one should go to live upon marriage, what should be done with aged 
parents, and so on. But households are also the product of demographic forces 
that affect what kin are available for coresidence. Patrilocal postmarital 
residence, for example, is not necessarily found just because the prevailing 
` economic system and cultural norms make patrilocality desirable: If a man's 
parents have died before he marries, patrilocality (at least in a strict sense) is 
impossible. Likewise, under certain demographic conditions, such as various 
migration patterns, new forms of complex family coresidence may become 
common, despite the Jack of cultural norms favoring such coresidence (Bret- 
tell 1986, Carrion 1988). 

Demographic explanations of household systems first became important in 
family history in the wake of claims tkat, even where complex family 
households are the cultural norm and are am optimal adaptation to land tenure 
arrangements, most households most of the time would be nuclear (Berkner 
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1972, 1975, Wheaton 1975). These claims, in turn, were based on ‘earlier 
work by Levy (1965), who argued that high mortality rates prevailing in the 
past prevented large numbers of complex family households from developing, 
even in societies which favored such arrangements. This view has itself 
become part of the new orthodoxy, as expressed, for example, by Goode 
(1982:108), who maintains that in the past high mortality rates "made it 
inevitable that most families would not be able to create and maintain a large 
household." Likewise, Ruggles (1987 :61) cites Levy in making the claim that 
"the frequency of extended families was low in the preindustrial period." He 
goes on to argue that patterns of household composition in the past are in good 
part "simply reflections of variation in demographic conditions during the past 
four centuries" (Ruggles 1990:22). 


Microsimulation 


As family historians have turned to demography for help in sorting out the 
effects of demographic forces on household formation, they have encountered 
demographers who have themselves been struggling with the complexities of 
examining household units (Bongaarts 1987, Clarke 1986, Keilman 1988, 
Willekins 1988, Watkins et al 1987). A variety of modeling procedures have 
been proposed in dealing with these questions, but in recent years 
microsimulation has received the most attention. Through microsimulation, 
the impact of varying demographic rates (e.g. mortality, age at marriage) on 
the distribution of household types can be tested, at least in theory. Thus, for 
example, one can ask what proportion of households would be complex at any 
given time if certain stem family rules were followed, under demographic 
rates prevailing in eighteenth-century England. Moreover, one can ask how 
much impact a change of a specified amount in any demographic rate would 
have on this proportion. 

Indeed, much of the early work in household microsimulation has been 
done on just this problem. Wachter, Hammel & Laslett (1978, Wachter 1987) 
tested Levy's demographic constraint thesis by asking whether such con- 
straints could account for the low proportion of complex family households in 
preindustrial England. They concluded that demographic forces did not pre- 
clude higher proportions of complex family households, and thus there must 
have been other pressures—economic or cultural—which favored a nuclear 
family household system. d 

Yet the notion of severe demographic constraints has been hard to kill. In 
an influential recent book, Ruggles (1987) resurrects Levy's hypothesis, 
using a different microsimulation model to reach conclusions diametrically 
opposed to Wachter et al. Ruggles (1987 : xviii) asserts that "there were few 
extended families before the industrial revolution primarily because most 
people had a shortage of living relatives." The microsimulation results, he 
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argues, show that a large majority of those in preindustrial England “who 
could have resided in stem families act-ally did so" (p. 121), and that 
therefore the stem family household was the preferred family form. The 
implications of this analysis are far-reacking, for Ruggles concludes that 
historical changes in household structure gre due to changing demographic 
rates rather than changing economic pressures. Predictably, members of the 
Cambridge Group have found Ruggles' analysis to be flawed (Smith 1989). 

That such contradictory conclusions can be reached through microsimula- 
tion methods has led some to be wary of this approach to solving the problem 
of demographic constraints (King 1990). The technical sophistication re- 
quired to evaluate competing microsimula:ion models, and the fact that an 
informed evaluation would necessitate examination of the actual algorithms 
used, puts the user of such results at a great disadvantage. It is also easy to 
misinterpret the findings, for they are based on precise models cf household 
formation rules that may have little relationship to any particular historical 
setting. 

As we have seen, for example, stem family rules were not the only ones 
that led to the formation of complex households in the past, and thus any 
attempt to conclude from models based or such stem family principles that 
most households in the past must have been nuclear is logically flawed. Both 
Ruggles (1987) and Wachter (1987:223; have been guilty of this leap. 
Kertzer (1989), for example, has shown that the joint family household 
formation rules prevalent in sharecropping Italy led to high proportions of 
complex family households; he disproves the common assertion that such 
high proportions of multinuclear households could only be found where 
female marriage age was low, as in Russia. The existence of high proportions 
of complex family households depended no: simply on prevailing demograph- 
ic rates, but on cultural norms regarding pcstmarital residence and the princi- 
ples governing household fissioning. These, in turn, are closely tied to 
prevailing political economic forces. This is not to argue that demographic 
constraints play no role. However, the claim that such constraints are more 
influential than economic or cultural forces cannot be sustained. 


THEORETICAL IMPLICATIONS OF HISTORICAL 
HOUSEHOLD STUDY 


The burst of quantitative studies in European family history in the 1970s 
initially seemed to herald the end of the cid wisdom regarding the complex 
nature of family forms in the western past, and, with it, a rejection of earlier 
views that equated industrialization with 3ousehold nuclearization. The re- 
search of the 1980s, however, has undermined this new orthodoxy, with 
evidence of household complexity pourirg in from much of preindustrial 


HOUSEHOLD HISTORY 173 


Europe. This new evidence has been piling up faster than its theoretical 
implications can be assimilated, and we now have no new theoretical para- 
digm comparable either to the older modernization model or the newer model 
of the “myth” of the complex family household in the European past. 

The theoretical implications of the debates we have here chronicled go 
beyond an understanding of coresidential arrangements themselves, for they 
figure centrally in the efforts to understand the remarkable economic and 
political developments in the west during the eighteenth and nineteenth 
centuries. In short, what was it about the western societies that led to modern 
industrialization and the growth of modern forms of capitalism and, even, 
parliamentary democracy? In trying to determine what might have been 
distinctive about these societies, and what element may have been conducive 
to the accumulation of capital and the necessary mobility of labor, scholars 
have often turned to the presumed "individualism" of western culture (Mac- 
farlane 1979, 1987), epitomized by nuclear family coresidence, together with 
the related feature of late age at marriage (especially for women). Indeed, 
Todd (1985, 1987) has gone so far as to argue that nuclear family coresidence 
lay behind the evolution of political democracy, while multiple family house- 
hold systems produced Communist rule. What the studies discussed in this 
review demonstrate, however, is that before sociologists go too far in theoriz- 
ing the role of household systems or related systems of marriage in economic 
and political development, they must recognize the complexity of the distribu- 
tion of these systems, and the factors that affect this distribution. 

Studies of European family life have clearly become more theoretically 
sophisticated over the past decade. Debates on the relationship between the 
individual and the family or household group, often linked to concerns over 
the use of the concept of "family strategies," have shed light on household 
processes as the outcomes of complex negotiations among partially conflict- 
ing perspectives of individual family members, with both gender and 
age acting as important lines of conflict (Moch et al 1987). Laslett (1984, 
1988), in this regard, has written of the family as a "knot of individual 
interests." 

Meanwhile, earlier, crude notions of family life cycles have been thrown 
out, as greater sophistication in interpreting the complex relationships be- 
tween family relations and historical change has been made possible through 
the development of life course perspectives (Elder 1981, Kertzer & Schiaffino 
1983, Kertzer 1986, Alter 1988, Vinovskis 1988). Moreover, households are 
increasingly being seen not simply as isolated social units, but as a key means 
by which individuals cope with the economic, political, and other pressures 
that confront them from the larger society (Rapp et al 1983). The relationship 
of households to larger kin networks has also been receiving more attention 
(Levi 1990). Yet, questions as to the “emergent” properties of the household 
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remain; households are not simply reducible to the sum of their parts (Netting 
et al 1984). 

As we struggle toward a satisfactory explanation for the diversity of 
western household systems, three elements merit particular attention: (a) 
political economic forces; (b) demographic forces; and (c) culture. As sug- 
gested in this review, political economic >ressures may ultimately explain 
much of the diversity of household systems found in preindustrial Europe, 
ranging from the serf households and landless cotters of eastern Europe, to the 
sharecroppers and land-poor peasants of Italy, to the peasant proprietors of 
southern and central Europe. Demographic forces, while conditioning the 
kinds of coresidential arrangements that are possible and influencing life 
course patterns of coresidence, do not explain the differences in household 
systems found in preindustrial Europe. This follows from the observation that 
differences in household forms do not correspond neatly with differences in 
demographic rates. Moreover, the most c-ucial “demographic” variable of 
them all, age at marriage, is better viewed az the product of political economic 
arrangements and cultural norms. 

The relationship between political economy and household composition is 
mediated by cultural norms, and these caa have an independent influence 
(Reher 1988). For example, in sharecropping areas of Italy, the agricultural 
wage laborers, while not having nearly tke proportion of complex family 
households as the sharecroppers, lived much more frequently in such house- 
holds than did English agricultural wage laborers (Kertzer 1984). Laslett 
(1983:547—549), in this connection, has drawn attention to the distinction 
between settings in which the household is a unit of production (a "work 
group”), and those in which it is not. Where households are not work groups, 
he argues, their composition is not determined by the logic of the productive 
unit; they are thus free to take on "something of the form and structure 
normative to the society which surrounds -hem." Similarly, existing house- 
hold systems have a kind of cultural momentum. A good example is provided 
by the industrialization that took place around the Spanish town of Bresalü. 
People did not abandon overnight their attechment to the stem-family norms 
that had long been associated with peasant agriculture. Rather, in this and in 
other cases, people used the preexisting stem family household system as a 
resource in adapting to the demands of a changed economic situation (Flaquer 
1986). 

We are now able to begin to grasp the contours of a much more complex 
western family heritage than earlier scholars had recognized. Although such 
models as those originally proposed by Laslett and Hajnal for western family 
history have been shown to be wanting, the; have served an invaluable role in 
stimulating and guiding family history research. Further advances in this field 
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depend on the development of new theoretical constructs; I have tried here to 
sketch out what some of the building blocks of such theories may be. 
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Abstract 


This paper reviews recent developments in the analysis of business power and 
the corporate elite, focussing on those approaches rooted in the techniques of 
social network analysis. A typology of research strategies is outlined, and this 
is illustrated through discussions of North American studies. It is argued that 
results from these studies cannot be generalized across all societies. This is 
illustrated with reference to European and Asian studies, where a number of 
variant patterns of corporate development can be discerned. Particular atten- 
tion is given to Japan and the development of its pattern of corporate organiza- 
tion, which contrasts sharply with the dominant Anglo-American pattern. 


INTRODUCTION 


The business enterprise has for long been an important topic for sociological 
and, of course, economic investigation. The reasons for this are clear: Its 
organization and operation are central to the structure of a capitalist industrial 
society, forming the basis of both the class structure and wider power 
relations. The need to study the capitalist business enterprise has been un- 
derlined by the recent political changes that swept across Eastern Europe. 
These changes have led to more enterprise autonomy and a greater scope for 
property and market relations in structuring the formerly planned economies. 

Research in this area has largely been structured by a debate over the 
“managerial revolution,” a debate concerned with changing patterns of 
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ownership and control in the modern en-erprise and their implications for 
business behavior. If the enterprise changes, patterns of class and power will 
change. (Berle & Means 1932, Scott 1985'. But attention has recently shifted 
from the structure of individual enterprises to the social relations that exist 
between enterprises. This newer research 2mphasizes the social networks in 
which enterprises are embedded, and the importance of viewing these net- 
works as arenas of power. Beginning with “he path-breaking articles of Zeitlin 
(1974) and Benson (1975), interest has grown in the study of in- 
terorganizational relations (Nystrom & Sterbuck 1981, Aldrich 1979, Meyer 
& Scott 1983). 

The study of interlocking directorships is at the center of this reorientation. 
An interlocking directorship exists when a particular individual sits on two or 
more corporate boards. The boards of large enterprises include both internal 
executives and outside non-executives anong their members. The outside 
directors include a number of public and political figures, but most are 
recruited from other companies and especially from the banking, insurance, 
and investment sectors. Some outside directors will hold two, three, or more 
separate directorships in large companies Many of these people hold only 
outside directorships, but others are internal executive officers (generally 
presidents or chief executives) of companies in which they have their princi- 
pal business interest. Their directorships spread throughout the economy, and 
they form a corporate or business elite, an "inner circle" of corporate de- 
cision-makers with power and influence ac-oss the business system as a whole 
(Useem 1984). 

Much can be learned from investigatirg the social background, recruit- 
ment, and outlook of this inner circle. 3ut research has given increased 
attention to the structure of the network of connections that they generate 
among enterprises. The presence of the same director on two or more boards 
creates a social relation between the two enterprises, and the simultaneous 
occupancy of multiple company boards by the members of the inner circle 
creates a complex web of social relations. The significance attached to such 
relations must, of course, be assessed, bu: it is clear that, at its weakest, an 
interlock involves a potential for communication and for the exercise of 
influence and power. Some attempts to assess the significance of interlocks 
have been carried out through the investigation of the reestablishment of ties 
that are broken on the death or retirement 3f directors (Ornstein 1982, Palm- 
er 1983a, b, Stearns & Mizruchi 1986). The web of interlocking director- 
ships has been at the center of attention ia studies of networks of corporate 
power, though the frontiers of this research have increasingly been pushed 
forward by the consideration of such phenomena as intercorporate share- 
holdings. 

Investigation of intercorporate networks in business has a long history in 
the United States, dating from Moody’s _ournalistic investigations into the 
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trusts and the official Pujo inquiry into the Money Trust of New York 
bankers. But it was only during the 1970s that researchers began to draw on 
technical developments in the methodology of social network analysis to 
model business relations. This offered powerful new models of corporate 
structure. Formal models of intercorporate relations can be constructed in 
which enterprises are represented by "points" and the ties between them by 
“lines.” Such models can easily be visualized and, in the case of small 
networks, can be drawn in diagramatic form. Graph theory, one of the most 
powerful of the formal theories used in social network analysis, allows the 
deeper structural properties of these models to be analyzed mathematically 
(Harary et al 1965, Berkowitz 1982). Through the 1970s and 1980s, a 
considerable research base has developed . A large body of work has also now 
developed outside of North America as researchers have taken up and elabo- 
rated the theoretical ideas that emerged in descriptions and explanations of the 
American situation. 

Research has, however, remained largely at the descriptive level, though 
some important departures from a purely descriptive approach are reviewed 
later in this paper. Descriptive accounts portray actual patterns of in- 
tercorporate relations at particular periods and in particular places, but only 
recently has any sustained move been made toward the more analytical and 
explanatory concerns. Moving away from the relatively formal treatment of 
corporate networks, researchers have undertaken analyses that place in- 
tercorporate relations in their concrete historical and cultural contexts. In this 
way, the analysis of networks of corporate power moved into the mainstream 
of historical sociology. North American research has focussed on the organ- 
ization of business activity in relatively homogeneous economic, cultural, and 
political terms. Thus, many researchers proceed as if their conclusions could 
be generalized across all societies. The descriptions and explanations de- 
veloped in these studies may not, however, apply outside this context. 
European and Asian businesses, for example, show important divergencies 
from the American pattern, and different structures of intercorporate relations 
have been found by those investigating these societies. This review aims to 
update the previous reviews of American research (Useem 1980, Glasberg & 
Schwartz 1983) and to present an indication of some recent developments in 
research on Europe and Asia. Research on East Asia is given special attention 
because this highlights the central concerns of current research on networks of 
corporate power. 


RESEARCH STRATEGIES FOR NETWORKS OF 
CORPORATE POWER 


Any kind of power relation is difficult to conceptualize (Clegg 1988) and no 
less difficult to observe. For this reason, research on networks of corporate 
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power has assumed that interlocking directorships, which are relatively easy 
to observe, can be treated as indicators cr traces of other corporate power 
relations. There has, however, been a lacs of clarity about both the type of 
power relation of interest and the unit of analysis used. 

Interlocking directorships, for example can be studied as interesting and 
significant power relations in their own rigat or as indicators of, say, relations 
of share ownership. In the latter case, independent evidence must be produced 
to support the claimed association betweea directorships and shareholdings. 
Very often, however, interlocks have been assumed to be unproblematic 
traces of shareholding relations. Similarly, the focus of interlock studies may 
be on the formation of bilateral linkages between particular boards, or on the 
overall pattern of connection among the directors. Where both the type of 
relation and the unit of analysis are inadequately specified, the results of 
research must be treated with great caution. 

Power in intercorporate networks is based on at least three distinct kinds of 
intercorporate relation: personal, capital, and commercial. Personal and capi- 
tal relations can, together, be seen as tke principal control relations that 
surround enterprises. Personal relations are those links between agents that 
involve direct connections between peopie, or that involve the sharing or 
exchange of personnel. The most important types of personal relations in 
business are interlocking directorships and the kinship relations among the 
individuals involved in various corporations. Also of importance are such 
personal relations as those of friendship, acquaintance, and neighboring. 
Capital relations are the links between business agents that result from 
shareholdings and from the granting or withholding of credit. Families that 
invest in two or more enterprises, and banks that lend to two or more 
industrial corporations, for example, create capital relations between the 
enterprises involved. The third type of imtercorporate relation, commercial 
relations, involves the trading links that arise through the normal buying and 
selling of goods and services on the mar«xet. 

Each of these types of relation can be sudied at any one of three levels of 
analysis: people, enterprises, or sectors. Where people are taken as the unit of 
analysis between which relations are impated, the research strategy is con- 
cerned with such things as the formation of families and kinship groups and 
other kinds of solidaristic and cohesive groupings. When the focus of atten- 
tion is on enterprises, the researcher will be interested in the ways in which 
intercorporate relations lead to the forma&on of cartels and coalitions. At a 
more complex level of analysis, the unit of analysis may be the sector, when 
particular areas of activity such as product markets or industries are of 
interest. 

Research may thus be concerned with any one of three types of relation 
(personal, capital and commercial) and can involve any one of three units of 
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Table 1 Research strategies for intercorporate networks 


Types of Relation 


Units Personal Capital Commercial 
People 1 2 3 
Enterprises 4 5 6 
Sectors 7 8 9 


analysis (people, enterprises, or sectors). A cross-classification of these two 
dimensions (Table 1) shows the existence of nine distinct research strategies 
in the analysis of intercorporate relations. The unit-of-analysis dimension in 
this typology is closely connected with some of the ideas in Burt's (1980) 
influential typology of concepts of network structure, but the categories of the 
two schemas do not coincide. Examples of each research strategy are given so 
as to provide a framework for the reviews that follow. 

Research on personal relations, then, can take people, enterprises, or 
sectors as its unit of analysis, each being associated with a distinct strategy of 
research. Research using Strategy One would concern the informal in- 
terpersonal relations that arise in and through business activities, and a major 
interest would be to uncover what these linked individuals may have in 
common apart from their shared involvement in a particular enterprise or 
group of enterprises. In research based on Strategy Four, on the other hand, 
the groups of enterprises themselves are central. Research using Strategy 
Seven extends this concern with interpersonal relations to the system level and 
is involved with the ways in which interpersonal connections might contribute 
to the cohesion or segmentation of classes and elites. 

Turning to research on capital relations, we may again see three distinct 
strategies, depending on which unit of analysis is chosen. While Strategy 
Four extends the analysis of kinship and other personal groups from the 
people themselves to the companies with which they are involved, research 
based on Strategy Two extends the analysis into the common shareholdings 
and investments that these people have. Such research would look at the 
overlapping patterns of personal and family shareholdings and lending. With 
Strategy Five, this concern is extended into an investigation of the interest 
groups and financial empires created through aligned investments and bank 
lending. At the sectoral level, research on capital relations approaches one of 
the central themes of the theory of finance capital (Hilferding 1910), the 
extent of the fusion between the financial and industrial sectors to form a 
single system, of finance capital. 

Research on commercial relations with people as its focus (Strategy Three) 
is concerned with individual trading relations in interpersonal settings. Such 
research would study, for example, the day-to-day operations of securities 
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and commodities markets. Commercial relations between enterprises through 
market transactions (Strategy Six) have been central to much economic 
theory, but sociologists have given far lesz attention to the cartels and trusts 
formed in this way. The final strategy of research (Strategy Nine) involves the 
examination of capital relations between sectors of an economy, as, for 
example, in input-output analysis. Few attempts have yet been made to apply 
social network analysis to such data. 

As with any ideal types, these stratezies are not mutually exclusive. 
Particular pieces of research, for examp:e, often draw on various of the 
strategies. Nevertheless, researchers tend -o concentrate on one or the other 
strategy. In the remainder of this paper I show how each of these research 
strategies has been applied in research on corporate power, and I look in turn 
at North American work, European studies, and research on East Asia. 


NORTH AMERICAN RESEARCH 


This section reviews trends in North American research, concentrating on 
developments since the previous reports in the Annual Review (Useem 1980, 
Glasberg & Schwartz 1983), and on research into the American and Canadian 
economies themselves. North American research on other societies is dis- 
cussed in a later section. 

Early concerns with centrality in interlocking director networks have gener- 
ated a thriving research tradition that resulted in a powerful concept of 
financial hegemony. This idea evolved from the work of the Stony Brook 
research group (an important paper of which remains unpublished) (Bearden 
et al 1975). They studied the growth of connections between finance and 
industry, and they discovered the existence of loose regional groupings 
centred around banks. These groupings were, however, tied into an extensive 
and cohesive national network with its fozus in the institutions of the New 
York money market. Reflecting on this research Mintz & Schwartz (1981a,b, 
1983) formulated the idea of financial hegemony to describe the centrality of 
banks in both the national and the regional structure of the economy (1985). 
Their view provides the basis for an art culation between formal network 
ideas and substantive work on capital flows and decision-making, and it is 
closely linked with theoretical work on firance capital undertaken by Zeitlin 
and others. (See the essays in Schwartz _987). 

The shape of the research tradition in North America can best be reviewed 
in terms of the research strategies identifisd in Table 1. An important early 
example of research using Strategy One would be that of Zeitlin and his 
colleagues, who proposed the investigatioa of "kinecon groups.” These kin- 
ship networks form around the ownership and control of particular enter- 
prises. Though seen as applicable to such groups as the Mellons and the 
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Rockefellers (see also Knowles 1973), Zeitlin and his coworkers investigated 
the idea most fully in Chile (Zeitlin et al 1974, Zeitlin & Ratcliff 1989; see 
also Zeitlin & Ratcliff 1975). Such research is very much in the tradition of 
Domhoff's investigations of informal social networks among the capitalist 
class (Domhoff 1967,.1970), and, indeed, Domhoff has undertaken formal 
network ‘investigations himself using latent structure analysis (Bonacich & 
Domhoff 1981). Research using Strategy Four focuses on groups of compa- 
nies formed through intersecting personal relations. Here, cliques and clusters 
formed by means of, e.g. interlocking directorships earn considerable atten- 
tion (Allen 1978, Mintz & Schwartz 1983). Research using Strategy Seven 
looks more generally at issues of class cohesion and the formation of a 
corporate elite and an inner circle (Useem 1984, 1987, Ratcliff 1987). This 
recent conceptualization of multiple directors as an inner circle originated in 
Zeitlin (1974), who formalized the insight of early Congressional inquiries. 

Berkowitz (1975) uses strategy two to study the intersecting partnerships 
formed by the families that made up the "family compacts" of early American 
capitalism. Strategy five research looks at interweaving shareholdings among 
companies: this work has had very little impact in the United States, primarily 
due to the difficulty in getting adequate information on company sharehold- 
ings. This is a result of the limited extent of corporate disclosure legally 
required. The ownership research of Berle & Means (1932) was hampered by 
its reliance on inadequate data (Burch 1972, Zeitlin 1974, 1989). In- 
vestigations of intercorporate networks hampered in the same way have 
generally had to resort to treating interlocking directorships as proxies for the 
unobservable capital relations, using direct information only where this can be 
obtained through reports in the financial press (Sweezy 1939). The systematic 
shareholding data made available for the early 1980s by the Corporate Data 
Exchange has not been exploited by American researchers (See Corporate 
Data Exchange 1980, 1981). Undoubtedly the most exciting developments in 
this area have been the attempts by Mintz & Schwartz to develop research at 
the sectoral level (strategy eight). They have explored the ideas set out by 
writers on finance capital and the integration of banking and industry, and 
they have formulated the concept of "financial hegemony" (Mintz & Schwartz 
1985). As with much earlier research, however, theirs relies on such 
shareholding and banking data as has been reported in the financial press. 

Despite the considerable attention to interpersonal exchange in network 
analysis (Cook ed. 1987), research using Strategy Three has not been well 
applied to business networks. Behavior on the trading floor of tbe stock 
exchange, for example, would (before this was largely superseded by 
electronic share trading) have made a fascinating area of investigation. So far, 
Weber’s: (1894-1896) preliminary explorations on the German stock ex- 
change have not been pursued for any other country, though I understand that 
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some work on stock markets and futures -rading is under way. Despite the 
importance of commercial relations between enterprises in early discussions 
of business power in America (Brandeis 1914), little sociological research has 
used Strategy Six. Economic concentration has, instead, been studied mainly 
in the context of interest groups, using directorship and, to a limited extent, 
shareholding data. Rarely has such research given any sustained attention to 
the commercial relations assumed to be involved in such intercorporate 
groupings. Research using Strategy Nine, to investigate capital relations 
between sectors of the economy, has been limited. Only the works of Pfeffer 
& Salancick (1978) and Burt (1982 & 1983) have begun to explore this 
complex area. 

The mainstream of research on corporate interlocks in North America has 
used Strategy Four; the aim of most researchers has been to document the ties 
between enterprises, to disclose such things as density, cliques, and pattérns 
of relative centrality. This type of research may also be used, however, to 
generate the "dual" (Breiger 1974) of the corporate network—the network of 
personal relations among the people who siz on the boards (Strategy One). An 
example of this approach to personal cliques can be found in the recent work 
of Mintz (Bearden & Mintz 1987, Mintz 1989). 

The early work of the Stony Brook group described an extensive and 
cohesive national network of interlocks ard of capital flows, involving vir- 
tually all the major industrial and financial enterprises. Rejecting theories of 
resource dependence, bank control, and cless cohesion as partial and restrict- 
ed, (Glasberg & Schwartz 1983, but see also Pfeffer 1987), Mintz & 
Schwartz (1985) argue that the model of fimancial hegemony provides a more 
adequate and comprehensive framework. According to this viewpoint, banks 
and insurance companies monopolize the flow of investment capital and, 
therefore, their directors and top executives determine the broad lines of 
corporate development for the whole economy. This power, exercised by the 
inner circle, operates not through direct intervention in the discretionary 
decision-making of corporate boards, bu- through their ability to set the 
parameters of the corporate environment within which all large enterprises 
must act. Financial hegemony operates th-ough a process of structural con- 
straint. 

Canadian research has been inextricably linked with this American research 
tradition. The colonial history of Canada and the continuing impact of foreign 
capital (British and American) has shaped.the development of the Canadian 
corporate network. For this reason, many of the early theoretical debates 
centered around the idea of a satellite economy. This has led to a great deal of 
research into directors and executives orierted primarily toward the Canadian 
domestic economy and to those with a mcre global outlook (Clement 1975, 
Niosi 1978, 1981, Carroll 1982, Brym 1989). The most important recent 
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work to appear on Canadian corporate power is that of Carroll (1986 and 
forthcoming), who documents the consolidation of finance capital and the 
power of Canadian finance capitalists in the Canadian economy during the 
post-war period, despite the strong challenge of American capital. Carroll’s 
work shows the clear similarities in network structure that exist between the 
United States and Canada. In both countries a high level of interlocking is 
mediated through a pattern of bank centrality; the dynamics of these can be 
interpreted as a structure of financial hegemony. 

Research on intercorporate networks has spawned a number of related 
studies, as investigators have attempted to show the connections between 
personal, commercial, and capital relations on the one hand, and wider social 
processes on the other. One study concerns the link between interlocks and 
corporate political donations (Koenig & Gogel 1981). Burt (1983) and 
Richardson (1987) have related corporate networks to profitability, while 
Glasberg (1981, 1987a,b) looked at the relationship between intercorporate 
relations and business crises. Ratcliff (1987) explored the link between 
interlocks, bank lending, and urban development. This work was recently 
extended by Palmer et al (1990), who connect it with the community power 
debate (Friedland & Palmer 1984). This has raised crucial theoretical issues 
concerning the formation of "collective actors" through intercorporate rela- 
tions and the nature of network constraints (Perucci & Potter 1989). 


EUROPEAN RESEARCH ON INTERCORPORATE 
RELATIONS 


Despite some early work which stimulated the ideas of.Hilferding and Lenin, 
research in Europe has been slower to develop than in the United States. A 
turning point was the collaborative ten-nation study carried out, under the 
influence of American research, by Stokman and his colleagues (Stokman et 
al 1985b, Scott 1987). This research was undertaken on a systematic and 
comparative basis and brought some of the leading American researchers 
together with those in Europe. It showed the important variations in network 
structure that exist within Europe and between continental Europe and the 
United States. While corporate power in the British economy can be un- 
derstood along broadly the same lines as that in the United States and Canada, 
patterns found in other European countries differ sharply from these and 
substantially from one another. 

Scott (1985, 19902) has argued that the situation in Britain falls within an 
Anglo-American system of capital mobilization, found variously in Britain, 
the United States, Canada, Australia, New Zealand, and South Africa. In all 
these economies a shift has occurred from personal forms of ownership to 
more impersonal systems of control. Institutional shareholdings have made a 
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pattern of financial hegemony the domirant feature of each economy. In 
Britain, for example, the majority of large enterprises are tied together into an 
extensive and cohesive network of personal, capital, and commercial relations 
focussed on the activities of the inner circe of finance capitalists who sit on 
the boards of the major banks (Scott & Criff 1984, 1985). Analysis of the 
network of interlocking directorships showed the existence of a number of 
loose bank-centered spheres of influence ir: which the executives of the major 
clients and customers of the leading banks were coopted to the various bank 
boards. These bank-centered groupings were not based on strong and recip- 
rocal capital relations. Each major enterprise relied on two or more banks for 
short-term loan capital, and they used the services of various merchant banks 
to obtain access to further capital. As a -esult, large blocks of their share 
capital were held by diverse and overlapoing sets of financial institutions. 
Large enterprises with dispersed share capital were, as a result, subject to 
control through a constellation of interests (Scott 1986, 1990). 

With certain variations, resulting from varying levels of foreign penetration 
and variations in the strength and survival of family ownership, this combina- 
tion of control through a constellation of interests and financial hegemony 
prevails in all the Anglo-American econorries. This similarity reflects similar 
patterns of business development, whici reflect a common cultural in- 
heritance and a uniformity of legal pract.ces. The countries of continental 
Europe, however, differ markedly from th s pattern. This divergence reflects 
a cultural and legal divergence and patterns of late industrialisation, often 
under the aegis of state sponsorship. 

German industrialization, for example, gave a key role to the big banks, 
which took substantial stakes in the uncertakings that they nurtured into 
industrial giants. This was first reported in Jeidels (1905) and became the 
basis of the arguments in Hilferding (1910) and Lenin (1917). The partition of 
Germany in 1945 weakened the financial power of the banks, but they 
remained a powerful force in the economy Reunification of the two German- 
ies might be expected to consolidate the power of the banks.by giving them, 
once again, considerable leverage over the economy of the eastern part of the 
country. Ziegler's research focused on interlocking directorships, using Strat- 
egy Four of Table 1 (Ziegler et al 1985z,b. See also Spohn & Bodemann 
1989). This research showed that both Germany and Austria had corporate 
interlock networks in which the position o? the banks was far stronger than in 
either Britain or the United States. The limited available evidence on in- 
tercorporate shareholdings suggests that business organization in Germany 
and Austria, and to a lesser extent in the Netherlands (Stokman et al 1985a) 
and Switzerland (Rusterholz 1985), invclves a pattern of oligarchic bank 
hegemony rather than a generalized financial hegemony. In these economies 
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far more than in the Anglo-American world the banks are likely to intervene 
in industrial decision-making. Some evidence indicates that the situation in 
the Scandinavian economies of Norway and Sweden is similar, though such 
research has been limited to interlocking directorships (see, for example, 
Ottosson 1989). 

In France (Swartz 1985, Marceau 1989), Belgium (Cuyvers & Meeusen 
1985), and Italy (Chiesi 1985, Martinelli & Chiesi 1989), by contrast, 
investment holding companies played a central role, and the French and 
Italian economies have had a significant level of state involvement in their 
economies during the post-war period. This pattern was well-established 
during the nineteenth century, with corporate finance coming from investment 
groups rather than from the stock exchange. Those companies that sought to 
expand beyond the resources of their original family controllers were, because 
of the absence of strong and developed stock exchanges, forced to align 
themselves with a bank or holding company that could guarantee them the 
capital they required. The banks did not, however, achieve the kind of 
dominance found in Germany, because of differences in the organization of 
the banking systems. The banking systems of these countries were divided 
into separate deposit and investment sectors, and the French, Belgian, and 
Italian banks could not play the samé role as the German universal banks. The 
“holding system" allowed the deposit banks to become involved in industry 
without baving to tie their deposit funds into long-term investments. The 
separate investment holding companies provided long-term capital and took 
substantial shareholdings. Around each of the main holding companies are 
large and cohesive groupings of financial and industrial enterprises. The 
enterprises within each retain a degree of autonomy, but they are subject to a 
loose coordination with others within their group. The boards of the holding 
companies are the centers of coordination. 

An examination of networks of corporate power in Europe shows that the 
models and concepts developed in North America cannot be applied without 
qualification outside the Anglo-American cultural sphere. Bank control and 
financial groups—epitomised, respectively, by the German and French sys- 
tems—are forms of impersonal capital mobilization which diverge from the 
pattern of control through a constellation of interests and generalized financial 
hegemony found in Britain and the United States. It might be expected that 
this divergence would be even more marked in economies that lie completely 
outside the European or Western sphere. Evidence on this is limited, but a 
substantial body of research exists for the economies of the Far East, and 
particularly for Japan. A consideration of this research, at the frontiers of 
current work, will help in constructing explanations of the various patterns 
described. 
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THE DEVELOPMENT OF RESEARCH ON ASIA 


Research on business groupings by Japanese scholars has a long history, 
though it has not, until very recently, inolved the explicit use of network 
concepts. The two key figures in modern Japanese studies of intercorporate 
shareholdings have been Futatsugi (1978, 1986) and Okumura (1979, 1984), 
both of whom have worked within Strategy Five of Table 1. Futatsugi has 
demonstrated clearly the inadequacy of atomistic models of the firm. He 
shows that it is impossible to identify a sirgle locus of control in the majority 
of the largest Japanese enterprises, the firrrs are elements in extensive webs of 
connections. Japanese capital relations are characterized by a reciprocity and 
circularity in shareholdings between' enterprises. As a result, to identify any 
seat of ultimate control is impossible. Alttough Futatsugi underestimated the 
extent of such circularity and interlocking :n Britain and the Uni:ed States, he 
correctly identified the high degree of reciprocity found in Japan and the ways 
in which these shareholdings are aligned tc form tightly knit enterprise groups 
or kigyoshudan. Okumura has elaborated on this perspective by contrasting 
the interweaving of institutional holdings in Britain and the United States with 
the interweaving of corporate holdings im Japan. While there are extensive 
bank and insurance company holdings in -apan, these companies hold shares 
on their own account rather than as investment trustees, and many of the 
reciprocal holdings are those of industrial corporations themselves. Okumura 
(1984) terms this a system of corporate xapital. 

The kigyoshudan of modern Japan are groups of enterprises tied together 
through reciprocal shareholdings, credit relations, and interlocking di- 
rectorships. The members of the group are rarely subsidiaries of one another; 
links of majority and minority control ace unusual among the large enter- 
prises. Instead, each enterprise within a group is controlled by coalitions of 
aligned participations. Each group member participates in the control of each 
of the other members. As a result, the group has a stable structure and is 
relatively immune from influence by ourside interests. Corporate power is 
maximized within each group and minimized between groups (Scott 1986: 
Chapter 7). 

Where interweaving shareholdings in the Anglo-American system resulted 
in an extensive network within which th> largest enterprises are controlled 
through constellations of interests, reciprocating shareholdings in Japan create 
a cellular network of intensively integrated groups. This cellular structure is 
combined with extensive, but looser, connections between groups. Older 
groups based in pre-war zaibatsu, such as Mitsui, Mitsubishi, and Sumitomo, 
maintain few links outside their own groups, but the other groups are linked, 
albeit loosely, into a wider network (Scott 1986:186—191). 

The work of Ueda (1986, 1989, 1990) has extended this tradition of 
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analysis from capital relations to personal relations, moving from Strategy 
Five to Strategy Four of Table 1. Ueda has also made explicit use of the 
techniques of social network analysis to further his research. His study 
concerned all quoted companies and major firms of 1983, and almost half 
were found to be connected into a single large component in the interlock 
network. The big six enterprise groups each had high levels of internal 
interlocking, but they were also tied together through intergroup interlocks. 
Despite the lower density of the Japanese network compared with that of the 
United States, Ueda indicates that some evidence suggests an inner circle 
(Useem 1984) in Japan. There is, he claims, a dominant segment of the 
capitalist class able to transcend the parochial interests of the separate enter- 
prise groups. This inner circle is not constituted solely of the multiple 
directors in the interlock network. The relatively small number of multiple 
directors with extensive business interests are joined in the Japanese inner 
circle by the leading figures of the separate groups who sit on key business 
organizations such as the keidanren. 

Numazaki (1986, 1988) has used similar methods in his study of business 
networks in Taiwan. He shows that interlocking directorships connect sepa- 
rate enterprise groups and that it is possible to identify an inner circle of the 
Taiwanese capitalist class. The interlock network in Taiwan includes a major- 
ity of the large companies, and there is a tendency towards bank centrality. 
The most central directors in this network are drawn from the xiejinhui, the 
central business federation, and Numazaki also shows that the multiple 
directors were tied together through marriage. Ueda, Numazaki, and others 
are now working together on an East Asian replication of Levine's (1985) 
atlas of corporate interlocks. 

The recent work of Hamilton and his colleagues (Hamilton et al 1987, 
forthcoming; Hamilton & Biggart 1988), which compares Japan with other 
East Asian societies, has been extremely influential in both the United States 
and Japan. Their work consolidated the move from descriptive to explanatory 
concerns in research, providing a framework in which the above findings can 
be understood. The debate over this work has shown very clearly that 
historical and cultural variations affect the form taken by impersonal posses- 
sion and that the Anglo American model does not apply in East Asia. The case 
is made for bringing network analysis into the mainstream of.historical and 
comparative sociology, showing how such factors as the structure of the state, 
cultural traditions, and processes of industrialization have shaped present-day 
business structures. 

Hamilton and his associates look at Japan, South Korea, and Taiwan and 
are concerned to understand the formation of enterprise groups—clusters of 
enterprises associated with one another through capital and personal relations. 
In this way, they successfully integrated Strategies Four and Five of Table 1. 
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Enterprise groups rather than individual enterprises, they argue, are the most 
significant actors in all three economies. They show that, within each society, 
enterprise groups have common characte-istics, though the kind of group 
formed varies considerably from one society to another. These variations 
reflect the particular constraints under which enterprises operate. These con- 
straints—the opportunities and limits withan which enterprises operate—are 
the products of the multiple social institutiens of each society (Hamilton et al 
1987:101), and especially of their state and of the culturally grounded norma- 
tive framework of economic action. 

They move from an explicitly interorgarizational approach to questions of 
corporate power, and in Japan they idemify the familiar kigyoshudan and 
keiretsu types of enterprise group. Entecprises can be vertically aligned 
through majority and minority holdings to form Keiretsu, and some enterprises 
may also be horizontally aligned througk reciprocal shareholdings. These 
reciprocal horizontal shareholdings tie keiretsu together into kigyoshudan. 
The latter identify themselves as commun ties of firms united through com- 
mon concerns and interests, tied to the sarne banks and insurance companies 
and coordinated through their President's committees. The Japanese econo- 
my, they argue, is dominated by six big kizyoshudan and ten smaller groups, 
the latter being more weakly integrated ard more dominated by the vertical 
keiretsu principle of organization. (See aBo Scott 1986.) 

In South Korea, on the other hand, the major economic actors are 50 
chaebol, industry-specific vertical alignmeats generally owned and controlled 
by individuals and families, though influenced by the state through its control 
of the banking system. The personal investments of the controlling families 
supplement vertical corporate holdings in these pyramidal structures, and the 
groups operate with a unified management structure. Interlocking di- 
rectorships among the enterprises making up a chaebol are rare, though 
family control is secured through the placing of family members on the 
various boards. In Taiwanese business, vertical relations are virtually absent 
in the jituangiye which dominate the economy. These groups are loosely 
integrated horizontal alignments linked through overlapping personal hold- 
ings and lacking any unified management structure. 

Hamilton et al stress the structure and role of the state and the nature of the 
kinship system as the crucial explanatory factors. South Korea, they argue, 
has a strong state, actively involved in the public and private spheres of the 
economy, while Taiwan has a strong society and a weak state. The political 
economy of Japan, by contrast, shows an intermediate power relation, where- 
by the state encourages the formation o^ large and independent business 
groups as the basis of an administrative regulation of the economy (Hamilton 
& Biggart 1988:76—81). Divergent state strategies reflect varying responses to 
war and occupation and subsequent attempts to establish and legitimize 
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systems of rule which draw on specific cultural traditions. The South Korean 
government drew on the imagery of the strong Confucian state, the Japanese 
authorities on the Tokugawa and Meiji practices of the central coordination of 
independent powers, and the Taiwanese employed the notion of a benevolent 
state. The success of the three legitimation strategies depended on the suppor- 
tive framework offered by the structure of family property in each society. 
(See the more generalized arguments in Hamilton 1984, 1990). 

The arguments of Hamilton and his colleagues accord with a wider body of 
research on kinship and business, and it is worth elaborating on these wider 
arguments. Business enterprise in Japan before World War II was pre- 
dominantly family enterprise, and many of its characteristic features derived 
from the traditional Japanese kinship system ‘as it existed among wealthy 
merchant and samurai families. The traditional Japanese family corresponds 
to what Le Play (1871) called the stem family and which Todd (1985) has 
recently described as the authoritarian family. Traditionally termed the ie, 
this was a corporate group subject to the authority of a male head who had 
power over family members in all spheres of life. The corporate family 
perpetuated itself as an entity from one generation to the next, and its 
headship passed through inheritance to a new head. The strong vertical link 
between the head and his sons was the central organizing principle of the 
family, as the head was responsible for managing the family property and for 
ensuring that the patrimony was passed on intact to the next generation (Befu 
1971). 

Only one son would be able to inherit the family property, and so 
noninheriting sons, generally the younger ones, would have to leave the stem 
family to form their own brancli families, though they would remain subject 
to the authority of the head of the senior line. The main line of the family 
dominated the subordinate branch lines. Whenever possible, the head of the 
main line would attempt to set up the branch families of his younger sons with 
some land and property, to which the branch family could add through its own 
efforts. The branch family would, in the case of merchant families, frequently 
operate as a business affiliate of the main family. 

Japanese industrialization centered around large-scale family enterprises, 
especially important strides being taken by merchant and samurai families 
which had strong connections with the state. These enterprises were shaped 
by the practices of tbe traditional ie and dozoku (Clark 1979, Hirschmeier & 
Yui 1975). The legal forms of the partnership and the joint stock company 
were ideally geared to many of these family enterprises. They allowed the 
main and branch families to take varying shares in the family enterprise, while 
also maintaining the unity of the business. Many of the larger enterprises, 
those operating in a number of industries, established central organizing 
agencies to maintain tight family control. An omotokata or honsha would 
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function as head office for each group, regulating the family property and 
allocating income to family members (Fruin 1983). Such bodies would 
sometimes take the form of a holding company which controlled the operating 
branches of the business. 

Many of the large and successful family enterprises which owned their own 
banks were encouraged by the state to channel bank funds into strategic 
industries. The holding companies coordimated their vast and sprawling in- 
terests through a pyramidal structure of majority and minority sharehold- 
ings—the business form characteristic of the zaibatsu. Such centralized con- 
trol of large enterprises would not have bezn possible for any one branch of 
the family alone. On this basis, and through the use of the banking mech- 
anism, the zaibatsu could take advantage >f the opportunities opened up by 
the industrial policy of the Japanese state in.the early twentieth century. While 
all Japanese family enterprises, to varying degrees, embodied the same 
kinship structure, the zaibatsu groupings arose only where entrepreneurs were 
able to rely on the sponsorship and support Df the government in building their 
groups. Thus, the formation of group enterprise in pre-war Japan must also be 
explained by the particular role played by the state. 

The Chinese kinship system differed comsiderably from that of Japan. The 
traditional Chinese family system was not the authoritarian stem family, but 
the patriarchal or “exogamous community ' family. In this structure all sons 
have an equal right to inherit from the family property on the death of the 
father, creating a tendency toward fragmentation of family holdings through 
subdivision. In the Chinese family, all sonscontinued to live in the patriarchal 
household after their marriage, and the fatlrer had authority over all who lived 
under his roof. On the death of the father, each son could claim his inheritance 
and use this to set up a new household. of which he would become the 
patriarchal head (Todd 1985). This form of family shaped Chinese business 
activities in Shanghai, Hong Kong, Taiwan, and in Chinese expatriate com- 
munities across the world. 

Wong (1985) has shown, for example, that firms in Hong Kong went 
through a cyclical process. They were often formed by partnerships of 
brothers, reflecting the fraternal solidarity of the Chinese family. Brothers 
formed themselves into overlapping and cooperative partnerships which, with 

, Success, eventually became fused into a single extended family enterprise in 
which one of the brothers took the role of head of the family. The head 
exercised authority as trustee for the interests of the whole family and for the 
shares of its various members in the busiress, adopting a patriarchal role of 
personalized business leadership. After th» death of the head, the enterprise 
fragmented as the sons, and their own sons, struggled to secure their rightful 
shares in the venture. Centralized firms gradually reverted to the overlapping 
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partnerships of brothers. And so the cycle proceeded through new genera- 
tions. 

Compared with Japan, therefore, Chinese businesses were less durable and 
more short-lived. There was no structural basis for unity and continuity over 
the generations. The authority of the patriarch was simply a lifetime phenom- 
enon, and no over-riding authority was vested in a senior lineage. 

Numazaki (1986, 1987, 1991 forthcoming) has shown how business enter- 
prise in Taiwan shows this same pattern. Family enterprise in Taiwan was 
loose and unstable, and industrial expansion was limited by the fragmentation 
cycle inherent in the Chinese family firm. Expansion has also been inhibited 
by the difficulty of joint stock company formation and the weakness of the 
stock exchange mechanism, which precluded the establishment of large-scale 
intercorporate holdings. The Chinese family form did not encourage the 
establishment of a coordinating honsha, as there was no structural basis for 
keeping the family patrimony intact. For these reasons, zaibatsu-type group- 
ings could not emerge in Taiwan, even if the state had attempted to encourage 
and sponsor them. 

The Taiwanese economy developed a dualistic structure under Japanese 
colonial rule. At the top of the corporate hierarchy were large firms run 
directly by the Japanese zaibatsu, and below them were smaller indigenous 
enterprises run along Chinese lines. The Nationalist government maintained 
this dualistic structure when it nationalized the Japanese assets, creating a 
dualism between large state enterprises and smaller private enterprises. This 
began to change with privatization of state assets in the 1950s and 1960s. 
Firms were acquired by Taiwanese families and therefore adapted to the 
existing pattern of family enterprise in private industry and land. The recent 
expansion of business activity in Taiwan has been possible only through 
extending the overlapping partnership principle to nonkin groups, a particu- 
larly unstable form of group enterprise. 

Numazaki shows that group enterprise exists in present day Taiwan because 
of overlapping personal investments and not because of intercorporate in- 
vestments. That is, he is showing the continuing importance of the kind of 
work carried out using Strategy Two of Table 1. Companies are linked 
together because related or associated individuals and families invest in them 
and cause them to engage in reciprocal commercial transactions. They form 
horizontal groupings of limited extent. As Wong (1985) and Greenhalgh 
^ (1984) show, the business network structure of the 1950s and 1960s was 
atomistic or cellular in nature. This differs from the form of group enterprise 
found in the Japanese zaibatsu, where personal investments were hierarchical- 
ly structured (though with some intercorporate holdings) into separate, seg- 
mental enterprise groups with a vertical organization. Numazaki also shows 
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that none of the Taiwanese family groups correspond to the branch/stem 
grouping found in Japan. Instead, there a-e alliances with the immediate 
family (father with son, brother with brotner) or between two immediate 
families (by descent or by marriage). These patterns, argues Numazaki, are 
essentially the same as the kegu or lianhao partnerships of the traditional 
merchants of China. 

The absence of intercorporate holdings in Taiwan reflects the low propor- 
tion of companies that are publicly traded cn the stock exchange: The legal 
and financial structure does not facilitate intercorporate shareholdings. Never- 
theless, Numazaki does show that the varicus groups have developed close 
linkages through investment and directorships—they are embedded in a wider 
economic network of personal, commercml, and capital relations. This, 
again, contrasts with the hermetically sealed and isolated zaibatsu of pre-war 
Japan. Network formation appears to be a paenomenon of the 1960s, related 
to the rapid industrialization and expansion which began then. For this reason, 
the most appropriate structures to compare with the Taiwanese group enter- 
prise might be with the post-war Japanese groups. 

It is possible to speculate a little on the siuation in Korea, for which more 
limited information is available (Lee 1987). Here, the traditional kinship 
system is closer to the Japanese pattern than it is to the Chinese. Its 
characteristic form of group enterprise, the chaebol, is structured in some 
respects like the pre-war Japanese zaibatsu. The chaebol are narrower in the 
range of industrial sectors in which they opecate, do not own their own banks 
(they rely on state banks), and have a centralized management structure 
similar to that of the keiretsu. These have emerged, argues Lee (1987), in 
response to recent rapid industrialization, as business leaders sought the 
security of doing business with those they krew and trusted. No equivalent of 
the honsha exists to service the stem family. Instead, families hold assets 
through direct investments and through securities companies. The dynastic 
structure of the chaebol has meant that they have developed few in- 
tercorporate linkages. The economy has a cellular structure with no overarch- 
ing network of the kind found in Taiwan. 

Business enterprise in pre-war Japan and in post-war Tian shows varia- 
tions in structure related to forms of kinship and the state. The kigyoshudan 
which developed in post-war Japan were in fact already beginning to appear in 
the pre-war period (Tominomori, 1979-1980, pp. 25-27), but depended . 
crucially on post-war zaibatsu dissolution rHadley 1970). ` 

Futatsugi (1978; and see Ueda 1987, and Gerlach, forthcoming) shows that 
the post-war period saw a decline in persomal share ownership and a corre- 
sponding increase in corporate (rather than merely finanical) holdings. This is 
the basis of the argument of Okumura (1984) discussed earlier. New invest- 
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ment capital comes predominantly through the banking system, and the 
primary purpose of shareholding relations in Japan is to maintain control. 
There is, therefore, little incentive for personal investment in companies. 
Added to this was a positive desire on the part of the controllers of industrial 
enterprises to acquire shares in those companies with which they did business. 
Group enterprise was seen by Japanese business leaders as an appropriate 
strategy to pursue. This differentiated them from their British and American 
counterparts, who formed vertical links of a parent-subsidiary type, but not 
horizontal links of a kigyoshudan type. 


THE FUTURE OF RESEARCH 


Ihave tried to show how research on networks of corporate power has moved 
away from formal and highly generalized models of network structure toward 
an awareness of the cultural and historical embeddedness of personal, capital, 
and commercial relations in business. This was particularly apparent when 
research from outside the Anglo-American sphere was considered. Research- 
ers in Europe and Japan, and those American researchers who investigated 
these areas, had the immense advantage of being able to build on the already 
established and thriving North American research tradition on networks of 
corporate power. Í 

The discussion of Asian networks of corporate power, for example, showed 
the importance of the state and of kinship as crucial differentiating factors, 
operating in the context of varying culturally grounded experiences of in- 
dustrialization. The research on corporate networks and forms of group 
enterprise, therefore, relates to the debates around the relevance of cultural 
factors to economic development. The arguments of Dore (1973) and 
Morishima (1982), for example, suggest that the Confucian inheritance of 
Japan (distinct from that of China) was important in creating a group orienta- 
tion among business leaders. This suggests that an important area of future 
investigation might be the impact of cultural differences within Europe. 
Equally, there is a need for more research on the cultural and historical factors 
responsible for shaping the development of networks of corporate power in 
North America. 

These developments in comparative research strategy suggest new ways of 
approaching problems which have, hitherto, been regarded as closed debates. 
Of particular importance are the debates over bank control, management 
control, and the fate of the capitalist class. The debate over bank control, in 
relation to the American economy, seems to have run its course, with the 
notion of bank control being firmly rejected in favor of a model of financial 
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hegemony. (See the arguments of Fitch & Oppenheimer and the en- 
suing debate reprinted in Scott 1990b). Our knowledge of the German case 
reopens the question of the relevance of bank control to certain European 
economies, and there is also a need to explore the ways in which both this 
concept and the concept of financial hegemcny need to be modified in order 
to grasp the particularities of the form cf business enterprise found in 
Japan. l 

Similarly, the debate over the management control can be reinvigorated by 
a thorough-going concern with comparative esearch. The analysis of owner- 
ship and control has, traditionally, been focussed on the control situation of 
individual enterprises. The work of Zeitlin (1974, 1989) pointed to the 
inadequacies of this strategy, and it should not be forgotten, of course, that 
Zeitlin’s critique emerged from his invest.gation of patterns of corporate 
control in two rather different societies—the United States and Chile (Zeitlin 
et al 1974, Zeitlin & Ratcliff 1988). This wcrk highlighted the importance of 
interweaving intercorporate relations and laid the basis for the formulation of 
the concept of control through a constellation of interests (Scott 1990a). This 
concept places intercorporate relations at the zenter of the research agenda and 
points to the need to investigate the impact of different patterns of in- 
tercorporate relations for the control of particular enterprises within an eco- 
nomy. 

The demise of the capitalist class was confidently announced by Bell 
(1961), and many advocates of managericlism followed his lead in this. 
Research on networks of corporate power, om the other hand, has avoided this 
issue by referring simply to the corporate elite. The more scphisticated 
contributors in this area (Useem 1980, Schwartz 1987) have approached the 
issue of the capitalist class but have tended to equate corporate elite and 
capitalist class in a rather unreflective way There is a need to explore the 
class position of those who are recruited to the corporate elite and who are 
drawn from families with diversified, system-wide portfolios of wealth in 
numerous units of capital. Despite their lack of personal, entrepreneurial 
control, they have a direct supervisory position in the affairs of the very 
businesses on which their privileges and advantages depend. There seems to 
have been a partial dissociation of the mechanisms of capital reproduction 
from the mechanisms of class reproduction. Capital reproduction takes place 
in systems of impersonal possession, of the Anglo-American and other types. 
The reproduction of a capitalist class has ccme increasingly to involve other 
social mechanisms, such as their ability to wilize informal networks of social 
connection. Only recently has a collection of comparative papers (Bottomore 
& Brym 1989, see also Scott 1991) pointed the way ahead for research in this 
area. There is a great research agenda for 1he future. 
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Abstract 


The authors, by reviewing the recent work of several behavioral and social 
science disciplines concerned with the impact of war upon society, suggest 
that the study of war's social ramifications has been characterized by the 
publication of empirical work that neglects, to some extent, the larger socio- 
logical implications of war as well as its ability to reorder society. The work 
published over the last two decades and reviewed here was written in the 
United States in the wake of the Vietnam conflict. This work has examined 
the implications of military manpower recruitment and training, as well as the 
psychological and economic implications of wartime service. The immediacy 
of this conflict, however, may have deterred sociological analysts from 
systematically examining the macroscopic implications of social change, a 
void that is remedied, in part, by historical and literary arialyses that consider 
the long-range impacts of past wars on their. belligerent societies. Finally, the 
authors propose that the study of the life course may offer one-avenue of 
inquiry that can connect the micro- and macro- levels of analysis; thus 
connecting the soldier’s story to that of his changing society. 


INTRODUCTION 


Wars and their effects. are a continuing, permanent part of the personal and 
political landscapes that paradoxically: demand incorporation and defy ready 
integration into an acceptable world view. The warrior's image, his wounds, 
and the world he comes home to are a story that‘ has-been. retold in Western 
culture at least from the time of Homer. The lessons of the Odyssey are the 
same as those of late-twentieth-century social science—wars may end, but 
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they continue to reverberate in the lives of those who fought them and within 
the soldiers’ societies. The study of the effects of war, in the most useful and 
powerful accounts we have reviewed, is a dialectical process through which 
war becomes a touchstone of inquiry and -esponse. 

As a repeated, complex social event that profoundly affects lives and 
institutions, war might appropriately claim a solid place in contemporary 
sociological discussion. If our exploration of literature treating the effects of 
war has led us to any firm conclusion, however, it is that the subject is vast 
and ramified, and that its discrete elements cannot easily be separated from 
consideration of change in the society as a whole. Thus, this review has 
several differences from the typical overview of a field of research. We have 
found that the approach of categorizing the literature into different schools of 
research misses the main dynamic behind -he development of inquiry. It is 
more helpful to think of modern war as a. process through which (even in 
victory) societies reorder themselves, both in opposition to an outside enemy 
and internally, in accordance with the needs of the society momentarily 
unified (or nominally so) for the duration of hostilities. 

Rather than ignoring the political implications of the study of changes 
engendered by war, we emphasize the powerful relevance of the subject to an 
ongoing public discourse, where much of this literature originates. Rather 
than focusing narrowly on the work that sociologists have completed, our 
essay explores the research on war’s impact in a variety of social and 
behavioral sciences over the last 20 years and attempts to draw these together 
in a synthesizing perspective appropriate to sociology. 

Because of our focus on recent scholarsh-p, we slight the kind of analytical 
approach that focuses attention on remote antecedents. Instead, we discuss 
different ways in which recent American (and, occasionally, other) social 
science has drawn on the experience of the two World Wars and of Vietnam, 
and of the large peacetime military that the United States has maintained since 
World War II. Americans’ sharpened interest in the impact of war originated, 
we think, in a widespread need to come to -erms with the Vietnam conflict — 
to construct through social science no less than through literature and political 
discussion an agreed upon meaning for the war. Such meanings have neces- 
sarily been created slowly, most often first through examination of the 
continuing social and psychological careers of those who fought and survived 
it. The “hidden wounds" (Hendin & Haas 1984) playing out in the lives of 
these men have been a central motif in the dialogue over Vietnam, including 
that within social and behavioral science. 

We begin our exposition by drawing tcgether three apparently disparate 
incidents in the intellectual history of the United States, dating from the late 
1970s and early 1980s. We first discuss the initial, and highly politicized, 
attempt on the part of institutional social science to assess the lasting impact 
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of Vietnam for American society: Legacies of Vietnam, a curiously nonuni- 
form and incoherent series published as a government document. We then 
examine the emergence of the subject of Agent Orange, an illustration of the 
politically and culturally imperative need to assess war's impact. Finally, we 
explore some of the ways in which the theoretical and emotional distance 
granted by the passage of time has permitted historical scholars to analyze the 
dissonance between the destructiveness of modern warfare and the massive 
social organization that its conduct requires—using the Great War as their 
example. 

These three interrelated expositions indicate how the language and the 
analytic techniques for assessing war's impact are evolving only with difficul- 
ty. The problem is akin to that eloquently explored by the literary critic Paul 
Fussell (1975, 1989): The state, in waging war, must create and disseminate 
stories that justify and ennoble the cause, but such stories deny the soldiers' 
experience of war as a theater of error. The disjunction, Fussell indicates 
through literary analysis, takes time and creative imaginative effort to over- 
come, before the soldier's war and the nation's war are meaningfully con- 
nected. The burden of our essay is to show how the urgent, troubled move 
from the soldier's story to society's story, as reflected in social science over 
the past two decades, has been bound up with the need to incorporate and 
eventually integrate micro- and macro-level accounts, to bring together war as 
it has played itself out in the postwar lives of participants at various levels and 
in the institutional and ideological structures erected in order to bring these 
men to battle. 

Legacies of Vietnam consists of four detailed substantive volumes, initiated 
as social science's conscientious response to the personal and political an- 
guish that festered in the years following the war's conclusion (Rothbart et al 
1981, Laufer et al 1981, Kadushin et al 1981, Egendorf 1981). Each sub- 
stantive volume was written by a different team of researchers but used a 
common, complexly drawn, stratified sample that included men who served 
in Vietnam, who served in the military elsewhere, or who were not called into 
service. A coalition of Vietnam veterans' self-help groups had initiated the 
project, which for some time was carried out by a volunteer group of 
mental-health specialists and social scientists, before the National Institute of 
Mental Health and, finally, the Veterans Administration (acting in response to 
Congressional mandate) funded the project. Clinical psychology, social psy- 
chology, and sociology provided analytic frameworks to the separate teams, 
each of which subsequently published extensively upon the data (several 
directing their efforts toward general audiences). No effort was made, howev- 
er, to draw up common measures, to address theoretical (or political) agen- 
das, or to integrate the analyses upon their completion. 

The Legacies volumes and the authors’ subsequent materials treat many of 
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the key issues raised in the literature on the social impact of war: the fairness 
of the paths leading individuals to combat roles, the disruption of the life 
course by military service, the nature of combat and its psychological con- 
sequences, and the social structuring of reabsorption into the civilian world. 
Its parts, taken together, however, fail to offer a coherent argument. Although 
Legacies attained more than minimum technical adequacy, it was published 
by the Government Printing Office most Lnattractively and seems to have 
been accorded no reviews in professional jcurnals. Despite the powerful and 
explicitly felt imperative on the part of the project's participants to evolve a 
meaning for the experience of Vietnam tha. would link soldier's tales to the 
nation's, micro-level analysis in Legacies dominates macro, even in the 
works several of the teams later published for more popular audiences. 

The differing analyses surrounding the Agent Orange controversy can 
illustrate the imperative nature of assessing war's impact and suggest some 
limits that urgency can impose. American soldiers in Indochina were exposed 
to defoliants sent not by the enemy but by their own side, with the effects 
always dormant but perplexingly various. As the issue surfaced and spread in 
the late 1970s, veterans’ claims of harm to self or offspring could not be 
scientifically validated (Blackburn 1983), but the manifest strength of an 
association between perceived exposure amd subsequent acute distress has 
kept the issue alive (US Centers for Disease Control 1988 p. 2711). 

On the macroscopic level, sociological analysis indicates that Agent 
Orange has focused the malaise of numbers of veterans, providing means for 
the expression of their current sense of no: having been received back into 
communion with American society. Hall (1989) and Scott (1988) describe the 
social evolution of the Agent Orange issuc, emphasizing how, in complex 
interaction, a penumbra of meanings surrounds the issue's evolution in veter- 
ans’ groups, in the various agencies of government, and in the press. Tesh 
(1988) suggests how Agent Orange offer: veterans (and their supporters) 
meaning and cause for their pain in the reaCjustment process and allows their 
problems to be ascribed to an agent, thereby conveying a meaning for their 
suffering that bears a stamp of rationality amd science. Agent Orange poison- 
ing has become an objective correlative of ihe hidden wound—war’s impact 
emerging only after years, its analysis necessarily longitudinal. 

World War I has offered historians a field on which to analyze this process. 
Leed (1979) thus explores the interaction between German cultural categories 
and German soldiers’ experience of the Great War, concluding that the effect 
of combat was a disintegrative personal experience which rendered the self 
intuitively meaningless. Going into the waz, German culture comprehended 
battle as a test of self that could resolv2 the pressing contradictions of 
modernity. Instead, the harsh reality of battke marginalized its participants, its 
trenches ironically objectifying the very labyrinth that the experience of 
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modernity had come to seem. Whalen (1984), similarly, responding to con- 
temporary concerns about the impact of war while analyzing the careers of 
German veterans of the World War I, views the imagery of heroism and its 
negation in the context of the rise of a medical model of the treatment of 
disability. Medical treatment, he indicates, acted as a way station, separating 
the wounded, the war's most visible reminder of defeat, into a new anesthetic 
world located in hospital and recuperative environments. As such, the mean- 
ings emerging from fighting and defeat were put to sleep—but not settled, as 
1933 would reveal. 

While seeking to synthesize an essentially sociological perspective, we 
believe that the social effects of war— "hidden wounds"—become apparent at 
different moments to different disciplines, each with their own timeframes, 
analytical methods, and separate investigative questions. We therefore draw 
freely upon a variety of literatures. Our survey will emphasize fields of study 
that incorporate a longitudinal framework: institutional manpower analyses of 
military service; still more focused accounts of socioeconomic changes in 
individuals attributable to military service; and examinations of the health 
consequences of war service. In each of these, we aim to place the individual- 
level study in the context of the macroscopic work that has been done. These 
several lines of inquiry finally converge upon life course analysis, which 
serves to relate micro- and macro-levels of analysis. This, in turn, connects 
with a macroscopic sense of war's impact that centrally informs our synthesis, 
that the social effects of war are neither immediately apparent nor discrete. If 
war and military service sufficiently strain the existing social fabric, the 
changes engendered by war may rework existing social categories and refocus 
social change. 


WARTIME SERVICE, SOCIAL STRUCTURE, AND 
SOCIAL CHANGE 


The ambiguous struggle in Vietnam early led social scientists to explore the 
determinants of men's paths into the battle. The Selective Service System first 
drew the attention of social scientists (Davis & Dolbeare 1968, Wamsley 
1969), followed by Baskir & Strauss's (1978) aggregate study of the paths 
that the generation of young men followed as they passed into and out of war 
service in the Vietnam era. If a nation is a complexly subdivided entity, 
technical criteria of military manpower policy have moral dimension. Most 
Americans apparently want a "representative" force to be called up for wars, 
with age and gender exclusions from the representation taken for granted 
(Eitelberg 1978, Cohen 1985, cf. Foster et al 1987). This American value is 
to a degree distinctive, resting as it does upon the particular senses of the 
liberal state and of mission that are part of a widely shared American belief 
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system. The ideological implications of a censcript military, for the relation- 
ship of government to citizenry, who see it as unrepresentative, are sharp, as 
was recognized in Vietnam. 

Later micro level studies, both retrospective and longitudinal, reveal that 
Vietnam military service was not distributed in any straightforwardly inequi- 
table fashion. Utilizing the National Survey of Labor Market Experience, 
Useem (1980) finds that even though Vietnam-era service shortly after the age 
of 18 was more likely among those wh» had parents of relatively low 
socioeconomic status, the relationship be-ween socioeconomic status and 
service was not linear but seemed to dedend upon the mechanisms that 
allowed young men from well-placed pavents to prolong their education. 
Using the same survey, Shields (1981), fincs that the key determinants of the 
probability of serving in Vietnam varied markedly for draftees and for those 
who volunteered, as well as for those of different races. Whites fought less 
often both because they served elsewhere as volunteers and because they were 
more prone to claiming and receiving medical exclusions for minor ailments. 

In part, Shields's discovery explains the somewhat counterintuitive find- 
ings of Fligstein (1980), Berryman (1988], and Card (1983). Fligstein and 
Berryman each use large national samples of retrospectively derived data to 
examine the backgrounds of men who served in World War II, Korea, and 
Vietnam, finding in each case forces less unrepresentative than is sometimes 
alleged, although Vietnam differs somewhat. In a study based on longitudinal 
data that uses the 1960 Project Talent survey of high school students as its 
base, Card indicates that (with the exception of the less-educated, low-IQ 
students) military service during the Vietnam era was essentially equitable, 
but service in Vietnam, and especially in combat, was not spread evenly. 
People subject to the draft, rather then those who had a choice in tour of duty, 
were more likely to see front-line duty in Vietnam. If war service temporarily 
effaces many of the social distinctions of cizilian society—and this is a part of 
its theoretical interest to sociology— Vietnam did not do so entirely. If service 
in World War II provided some soldiers with a basis for transforming their 
civilian roles (Modell et al 1989), the American military experience in Korea 
may have more nearly reinforced prior standing (Biderman 1963). 

Accounts employing individual-level daa often cannot address changes in 
the ways society is organized, focusing ezclusively on shifts of individuals 
within it. Macroscopic accounts of war's inapact have in recent years been few 
in American sociology, but political scientsts and historians have contributed 
significantly to the discussion. Marwick's comparative study (1974) set the 
tone for many social-historical analyses (Becker 1985, Kocka 1984, Kennedy 
1980), but research techniques and precise zonceptualization have been great- 
ly enhanced since his pioneering work (e.z. Smith 1986). For example, the 
Great War, as Winter (1986) shows, had the impact of mitigating, both in 
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practice and ideal, some of the more glaring costs to the less advantaged of the 
British class system, without weakening the system itself (see also Wall & 
Winter 1988). Despite some redistribution of privilege and considerable 
change in the role of the state, the overall assessment of these social-historical 
accounts is of only minor restructuring of class lines, even in defeated 
nations. 

Enloe's significant Ethnic Soldiers argues that states, in waging wars, often 
elect to call upon groups ordinarily excluded from full participation in nation- 
al society on the grounds of ethnic difference (see also Smith 1981, Halloran 
1987). Her account explores the relations between political and military 
elites, between states and their populations, and between the class and 
"ethnic" dimensions of stratification. She argues shrewdly that even if war 
often integrates subpopulations, it does not do so by its nature, as is often 
maintained. If this is to happen it depends on several conditions: “The military 
must allow upward mobility for all objectively qualified members. . . . The 
system of service should be universal as opposed to selective. . . . There must 
be complete mixing in various sub-units, resisting the temptation to organize 
special units composed of one ethnic community. . . . Soldiers of all groups 
should be trusted enough to be assigned to the most strategically sensitive 
roles (Enloe 1980: 202-203). 

Similarly, war may affect the social implications of gender, a perspective 
often overlooked when one studies soldiers. Higonnet et al (1987: 41) main- 
tain that: “war exposes the relationship between women and the state, changes 
the material role of women, and therefore necessarily redefines the relation- 
ship between the rhetoric of gender and the gender-specific assignment of 
tasks. It eventually makes possible a new consciousness of gender discourse 
as social construct." Elshtain (1987) reminds us, in a highly personal analysis, 
that war is political in the deepest sense. She argues that in most modern 
societies, war and peace is an ideological dichotomy that is mapped homolo- 
gously to that relating male and female. Ideologically, the condition of war 
substantially enhances the salience of the male realm, with manifold ramifica- 
tions through the society for years afterward. Deploying a wide assortment of 
contemporaneous social-science studies, Campbell (1984) indicates that even 
if World War II did not have the kind of impact on gender roles that some 
feared it might, the impact on women's experience in a number of realms was 
such that the war promoted change, overall. Even the social construction of 
sexual preference may shift on account of war. Bérubé (1990) argues that the 
development of new psychiatric screening guidelines by the military in World 
War II incorporated the concept of the homosexual as a discrete personality 
type in accordance with emerging models of psychoanalytic theory, thus 
using the induction process as a catalyst in the redefinition of sexual prefer- 
ence as a social category. The shift was from a conceptualization as "so- 
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domite,” whose specific acts were prosecu-ed by the military justice system, 
to a "homosexual," understood as one of a class of individuals who must be 
excluded from the armed forces. 

War reorders categories to the extent st does because the modern state 
portrays war as society's effort. One early consequence of Americans' dis- 
avowal of Vietnam was the abandonment of conscription and Selective 
Service, the manpower device that had been heavily freighted with the notion 
of national purpose, to be replaced by a volunteer force. The All-Volunteer 
Force has been recruited with an emphasis upon ordinary "occupational" 
values: pay, benefits, skills training, and tne like (Moskos 1988, Segal et al 
1983, Segal 1989). Recruiting for the AVF was understandably carried out in 
those locations where such inducements would have greatest effect: among 
graduating high-school seniors who did not intend to enter college im- 
mediately, and among blacks, most particalarly in times of high unemploy- 
ment (Janowitz 1983). The transformation to an All-Volunteer Force follow- 
ing Vietnam prompted a good deal of sociologically relevant empirical 
research by manpower economists, including studies of the extent to which 
military manpower needs closely parallel and thus compete with those of 
civilian employment rather than complement them (Nelson 1986, Hosek & 
Peterson 1986, Black & Fraker 1986). In a model analysis of labor market 
processes as influenced by a large standirg military establishment, Mare & 
Winship (1984) suggest that one aggregate implication of this pattern of 
recruitment has been to reduce the impact on occupational distribution of 
blacks’ recent gains in formal education. While the analysis of Mare & 
Winship depends on exceptional data, we night well hope that their analytic 
framework may be turned to other histor cal circumstances, particularly to 
wars and their consequences. 


VIETNAM SERVICE AND PERSONAL CONSEQUENCES 


A revealing meta-analysis of the literature on psychological effects on soldiers 
of the Vietnam War (Kaylor et al 1987) indicates that more recent studies, by 
comparison with those carried out shortly after the conflict, have typically 
found more indications of adverse psychclogical effects. Since the relation- 
ship of the veterans’ ongoing “working through” of their own experience in 
Vietnam is (politicized and) part of the process of all Americans working 
through their nation's experience in that var, the "effect size" is not a fixed 
target: It will move historically, perhaps ir. part because of the publication of 
just such studies. “Rarely before have many claimants presented themselves 
to psychiatric examiners having read princed symptom checklists describing 
the diagnostic features of the disorder for which they seek compensation. This 
is not an uncommon event with posttraumatic stress disorder, on which 
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brochures are distributed by national service organizations, outreach centers, 
and others" (Atkinson et al 1982:1119; see also Sparr & Atkinson 1986). The 
inclusion of diagnostic criteria for post traumatic stress disorder (PTSD) in the 
mental health professions' official diagnostic handbook, Diagnostic and Sta- 
tistical Manual of Mental Disorders DSM-III in 1980 (refined six years later 
in DSM-III-R) has itself been a part of the effort to comprehend Vietnam (Foy 
et al 1987, Scott 1990, cf. Bogacz 1989). The coevolution of PTSD as a 
social phenomenon and a psychological speciality focusing on stress is an 
inescapable fact (e.g. Camp et al 1988), and the celebration of mediocre 
longitudinal studies of stress reactions of World War II veterans (Archibald et 
al 1962, Archibald & Tuddenham 1965) a striking scholarly concomitant. 

Many returning Vietnam veterans apparently felt a lack of closure of the 
morally confusing phase of life just completed, partly because of the per- 
ceived censoriousness of their fellow citizens. Based on the Legacies data set, 
Boulanger & Kadushin (1986) report that the path from trauma in Vietnam 
leads to adverse psychological reactions independently, through material 

` readjustment difficulties, and through the perception of an unwelcome return. 
Laufer et al (1981) find differences in gross psychological distress according 
to whether Vietnam veterans returned before or after 1968, at which point 
(they speculate) many in the homefront population had attached an aura of 
distaste, even disgrace, to the war and its unfortunate soldiers. Boulanger & 
Kadushin (1986), using different indicators but the same sample, do not 
concur, however, and Card (1983) finds no connection between date of 
service and psychologically distressful outcomes, although she does find a 
notable difference in economic success along this dimension. The actual 
process of reincorporation into civilian society after Vietnam, so plausibly a 
key element in the structuring of the life courses of those who fought, has 
resisted the relatively generalized analytic tools that sociology tends to utilize. 
Reincorporation has not yet found its ethnographer or historian. Indeed, the 
structuring of veterans' reincorporation into American society has been well 
examined only from a policy perspective (Levitan & Cleary 1973, Ross 1969, 
cf Waller 1944, Havighurst et al 1951). 

An important contribution of the Legacies of Vietnam group was an initial 
estimate of the proportion of veterans afflicted with PTSD, as well at the 
extent to which the lives of PTSD sufferers reflected the disorder. Kadushin et 
al (1981) found that almost one in five Vietnam veterans, regardless of extent 
of combat experience, experienced something resembling delayed stress reac- 
tion in tbe year following discharge, and that the proportion grew consider- 
ably as time passed. All wars, Kadushin and his colleagues imply, not just 
those renounced by the nation, can disturb subsequent lives in like fashion. 
Boulanger & Kadushin (1986:8) also maintain that stress symptoms were 
positively correlated with combat-group cohesion, a significant finding for 
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those who would link performance of war roles to the subsequent life course, 
and “one neglected by myths that emphasize the heroism of combat and the 
effectiveness of group cohesion." Laufer and his Legacies associates, by 
contrast, were particularly concerned to distinguish Vietnam from prior 
American wars. In subsequent articles, Laufer and his group examine differ- 
ences between most of the men in combat and the (rather small) subset of 
soldiers who reported witnessing or participating in what amount to atrocities. 
These, they argue, were uniquely correlated with the development of PTSD 
symptoms, especially for blacks, who were less able than whites to avoid 
confronting the humanity of their nonwhite victims (Laufer et al 1984, 1985, 
Frey-Wouters & Laufer 1986, cf Kadushin & Boulanger 1987). 

Card (1983) was able to find reasonatle measures of all of the PTSD 
diagnostic criteria and estimates that 19% of Vietnam veterans, as compared 
to 13% of other veterans and 12% of mon-Veterans, seem to have had 
PTSD-like symptoms by 1981. Among Vétnam veterans, only 1 among 14 
preservice “predispositions” examined (lcw self-confidence) and only one 
among 11 military-adjustment variables (iquor consumption) were signifi- 
cantly related to PTSD symptoms. But every one of ten combat-intensity 
measures was substantially related to PTSD. Card, unfortunately, does not 
really know much about men's wartime experiences, partly because these data 
(unlike the predispositions) were gathered retrospectively and were possibly 
colored by the reports of post-service well-being or distress, a methodological 
difficulty perhaps unavoidable (Williams 1984). Card examines eleven mili- 
tary adjustment items and ten aspects of military activity, but with almost all 
these, her focus is on trauma, pathology, ard combat effectiveness. She found 
out little about how successfully these men carried out the range of their 
military tasks, how successfully their units performed, or for that matter, how 
smoothly the men returned to civilian life. f 

By the late 1980s, the growing perception of a correlation between service 
in Vietnam and other health issues had attracted researchers utilizing con- 
ventional epidemiological techniques. Twc of the more notable attempts were 
the Vietnam Experience Study, (US Centers for Disease Contro] 1987, 1988, 
Boyle et al 1987) and the National Vietram Veterans Readjustment Study 
(Kulka et al 1988). The VES drew a random sample of about 8000 nonofficer 
Vietnam Army veterans who had entered the Army from 1965 ta 1971, and a 
roughly equal number of comparable non-Vietnam veterans from the same 
period. The NVVRS, a model of carefrl empiricism, followed the post- 
separation health, psychological and social adjustment, and psychological 
well-being of some 3000 male and female Vietnam veterans, non-Vietnam 
veterans, and civilian counterparts, through data gathered with exceptional 
efforts to assure representativeness. That prior aspects of servicemen's lives 
might have predisposed them to postwar distress was examined with more 
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breadth than Card and others, although these data are, unavoidably, retrospec- 
tive self-reports. 

The NVVRS took special pains to measure the prevalence of PTSD accur- 
ately, in line with its Congressional mandate. Responding to the convergence 
of prior literature upon the centrality of combat as a stressor, the study 
carefully derived sound (and, initially, before combination into an index, 
differentiated) measures of exposure to-war violence, successfully validating 
these self-reports against objective, inferential measures of exposure. 
Employing a multiple-indicators method, and a variety of validation tech- 
niques, the study found that of all male Vietnam veterans, about three in ten 
experienced the syndrome at some time or other, and that about half this 
proportion were currently suffering from it when the study went into the field 
in 1987. This proportion was about six times as high as for contemporary 
veterans of other theaters, and about twelve times as high as for nonveterans 
of the same ages. Both PTSD and a number of other postwar problems were 
far more prevalent among veterans exposed to high levels of military vio- 
lence. The contrast between PTSD prevalence among Vietnam veterans and 
others was greatly heightened in the NVVRS, as compared with other studies, 
because an explicit—and, arguably, almost war-specific—indicator of stress- 
ful event was included among the criteria for PTSD, in keeping with the 
Diagnostic and Statistical Manual of Mental Disorders (DSM). The NVVRS 
multivariate analysis was "consistent with a model of PTSD that posits a role 
for individual vulnerability (potentially including biological, psychological, 
and sociodemographic predisposing factors) and a role for exposure to en- 
vironmental factors (specifically, war zone stressors). . . . It is also clear that 
exposure to war zone stress makes a considerable contribution to the develop- 
ment of PTSD in war veterans that is independent of a broad range of potential 
predisposing factors" (Kulka et al 1988:40). Those less-advantaged were 
more vulnerable, and more so, those who had had psychological or behavior 
problems before adulthood. So were men who served in Vietnam when 
young, and those who rose the least high in pay grade while in the military. 
Those more afflicted more often joined veterans' organizations. Were these 
relationships the result of differing psychological resources, or of different 
social and material resources in the service and upon demobilization? 

So far, post-service mortality has been the most fully reported aspect of the 
Vietnam Experience Study which has also reported on physical and psycho- 
logical well-being. The key finding is that veterans who had served in 
Vietnam, as compared to both non-Vietnam veterans and non-veterans, suf- 
fered a significantly elevated mortality through external causes in the first 
several years after discharge, most especially through automobile accidents of 
all sorts and through poisoning, perhaps drug overdoses. (Breslin 1988 found 
the same association of service in Vietnam with automobile and poisoning 
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fatalities, especially shortly after the war, further finding that such effects 
were present both in members of the Army and of the Marines, although the 
former büt not the latter contained a high p-oportion of conscripts). In view of 
the-virtually self-inflicted nature of the excess mortality; an'inner wound is 
suggested.’-Some VES covariates interacted with the Vietnarn/other-theatre 
difference in military experience, but othe-s did not. Vietnam service made a 
special difference to the subsequent mortality of draftees more than to 
volunteers, to men who served before 1970 more than to those who served 
later, and to men in noncombat military occupational specialties. But Vietnam 
service was no more deleterious to those who served young than to those who 
served at older ages, or to those with higher rather than lower aptitude scores. 
The characteristics that mattered were those relating each soldier to the- 
military’ The second, noninteractive set, indexed aspects of the young man’s * 
positiori in the extramilitary social structure. And the interactions for differen- 
tial mortality appeared only among those whose service was not in Vietnam. 
The special relevance of these “military” variables to subsequent mortality 
was, in fact, erased by service in the war zone in Vietnam, which brought all 
soldiers who served there up to similarly elevated levels of mortality. Draft- 
ees, men serving before -1970, and nontactical soldiers had similar levels of 
post-separation mortality whether or not -heir service was in Vietnam. 

The health impact of war, at the indivicual level, still lacks z sociological 
analysis. War's various wounds are enumerated but never analyzed in their 
aggregate. Veterans are rarely seen in any substantial social context, and the 
substance of men's military careers is largely ignored (but see Cockerham 
1978a,b, 1979) apart from combat stress. Some clue to the value of develop- 
ing such considerations emerges in an epicemiological study of World War II 
survivors, carried out by Seltzer & Jab.on (1977, and see Keehn 1980, 
exáfilining American prisoners-of-war in World War II and Korea). Among 
the veterans, the usual socioeconomic determinants of mortality operated, but 
so, too, did military experience, and among the variables that counted was 
military rank attained. Rank was correlated with educational level, which was 
interactively related to postwar mortality: where educational level was high- 
est, the impact of rank mattered most, and where educational level was 
lowest, rank mattered least. 


MILITARY SERVICE AND ECCNOMIC PERFORMANCE 


One of the most active areas of study in recent years has been that exploring 
the economic ramifications of military service in the years after discharge. 
Martindale & Poston (1979:229) employ -970 census microdata to present a 
broadbrush study of the impact of servics in each of several wars on 1969 
income (see also Cutright 1974; Poston 1979). Their results indicate that, as 
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of 1969, war-era veterans generally tended to have higher incomes than 
nonveterans of like age and race, in part because veterans receive higher 
additional returns per year of formal educational attainment. The authors 
speculate that “this difference may well-be(due to characteristics of the 
bridging environment.to which the. veterans were exposed while serving in the 
military.” But white soldiers who served during the first part of the Vietnam 
conflict were “losers,” at least in 1969. 

The most extensive detail on the Vietnam life-course sequence is available 
in the Rothbart et al (1981) Legacies volume. The authors find that many 
Vietnam «veterans were somewhat disadvantaged in the labor;market by 
comparison with non-Vietnam veterans and nonveterans, primarily: because 
they were slower than others to return to school. Did: these Vietnam: veterans 
fail to choose prompt schooling upon return because they were restless, 
plagued by disturbing memories, apathetic, angry? We.do not learn here. 
(The Legacies data analyzed by Laufer«et al (1985) also show that veterans of 
‘Vietnam married at somewhat younger ages than did non-Vietnam, veterans or 
nonveterans, and had a lower likelihood of divorce, despite their younger 
marriages.) Military participation in the war is a black box here, too, although 

#a_scale of combat intensity was not significantly related to either educational 
or occupational attainment. 

The war's lasting impact differs:according to race, in a complex fashion. 
Unlike whites, black veterans (including? Vietnam veterans) attained more 
postwar education than nonveterans and held higher-status jobs, having 
emerged from their service with relatively enhanced or intact ambitions for 
the near-term future. Ironically, they also (especially Vietnam veterans) were 
more likely to be:unemployed, possibly for reasons related to the way public 
evaluation of the Vietnam war intersected with racial:and-class categories. 
Two early-1980s studies employing random assignment experiments, one 
carried out with middle managers and the other with personnel officers, 
suggest that potential employers perceived Vietnam veterans more negatively 
than they viewed other young men, otherwise identical, who had not served in 
Vietnam. In one study (Bordieri & Drehmer 1984), hypothetical descriptions 
of job candidates described as Vietnam veterans were less likely to elicit 
irecommendations. In the other (D' Anton 1983), discrimination on the basis of 
Vietnam service was limited to whites with little education. 

Goldberg & Warner (1987:78) employed Social Security earnings records 
to specify the training and background circumstances of those whose incomes 
benefited the most in the six years following separation. They conclude that 
"military experience is a close substitute for civilian experience in precisely 
the occupation groups where training appears to be the most transferable: the 
Electronics Equipment Repair, Medical, [and] Electrical/Mechanical Equip- 
ment Repair." Fredland & Little (1980) employed the National Longitudinal 
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Survey of Labor Market Experience to study a World War II cohort and found 
similar results. Cohen et al (1986) took advantage of a 15-year follow-up of 
late 1950s white male high school studeats to discover how Vietnam-era 
military service fit into educational careers (see also Marcus 1984; Jackson 
1986). Controlling to avoid measuring the correlates of the personal charac- 
teristics that actually may have determined recruitment to service in Vietnam, 
they find that on average enlisted men, net of other characteristics, lost two 
thirds of a year of formal educational attainment, but officers lost nothing. 
Among enlisted men, some educational gain was experienced by those who 
had entered service the most educationally deprived, for these men often 
achieved a high school diploma within the service. On the whole, they judge, 
the overall negative educational impact cf wartime service was something 
new in recent history. "[T]he transformation of military service from the 
positive to a negative educational agent may have affected recruiting adverse- 
ly insofar as this change was recognizec by potential enlistees." Mattilla 
(1978) offers one macroscopic reason that the life-course impact of Vietnam 
service may have been different from that of World War H and Korea. The 
Vietnam era GI Bill, was, in constant dollars, no less generous than that for 
World War II, but, relative to those opportunity costs incurred by studying, 
had become markedly costlier to take up. 

In a small way, the scholarship reviewed in this section is moving toward 
incorporating military service into a fuler life-course perspective, one, 
moreover, in which a social-historical context of the service might readily be 
incorporated in the analysis. No one has more fully embraced this perspective 
than Glen H. Elder, Jr. (Elder 1986, 1987, Elder & Bailey 1988, Elder & 
Caspi 1991, Elder & Clipp 1988, 1989, Pevalko & Elder 1990; for a German 
parallel, see Mayer 1988). Most of Elder’s work on the life-course impact of 
World War II rests on longitudinal data sets. originating in child-study projects 
at the University of California before the war. The cohort samples under 
examination are small, but were frequently observed as children and as 
adolescents, as well as since the war. Elde- has achieved a special familiarity 
with his data and has built well-constructed indices, including uniquely full 
measures of psychological states at particular times in his subjects’ develop- 
ment. (Unfortunately, the retrospective data on individuals! wartime experi- 
ence was elicited only after a great interval). 

Elder is concerned with delineating tne processes by which relatively 
disadvantaged individuals often came out head of where they might have, by 
virtue of their war service. He also strives to elucidate some of the costs to 
individuals of war service, particularly the costs of undergoing the trauma of 
combat. Each focus treats individual serv.cemen as active agents, and each 
analyzes considerably more complicated life-course paths than is characteris- 
tic of most of the micro-level studies of tae impact of war. Elder has so far 
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been cautious about attempting macroscopic evaluation of the war's impact on 
American society; his preferred outcome measures are firmly anchored in 
behaviors and psychological characteristics of individuals. 

Many initially disadvantaged men, Elder shows, entered military service at 
younger ages, often enlisting out of an immaturely impulsive patriotism, but 
also at a life-stage that was less disruptive to their careers than for their 
better-off fellows who entered later. Combat veterans in particular, Elder 
demonstrates, emerged from the war with a lessened sense of helplessness, 
and with an enhanced sense of assertiveness and resilience. After the war, 
many of these combat veterans were more capable, more active, and more in 
control of their own lives. These men, on the other hand, were less likely to 
pursue education after the war than were other veterans. Those of Elder's 
subjects who reported great wartime stress were the very men whose adoles- 
cent psychological makeup had been less resilient. Since they were also more 
likely to attach themselves to a "buddy," they often saw a close friend lose his 
life, and this was deeply traumatic, setting these men off from all their 
fellows. They were in various ways subject to postwar psychological buffet- 
ing, bearing analogy to PTSD. These same men were preeminently the men 
who still continue actively to retain wartime friendships, to attend reunions, 
and to value their wartime experiences in a way that suggests they may still be 
working through the war, just as many Vietnam veterans are. 


TOWARD A COHERENT ANALYTIC FRAMEWORK 


The social impact of war merits systematic treatment by social scientists but 
has not received it. Many substantial empirical contributions have added to 
our understanding of aspects of this subject, but these have rarely been well 
enough integrated in the literature to bring them to bear on one another. 
Rather, the currency of particular elements of war has led to the sporadic 
production of ad hoc, rather than theoretically directed, work. We propose 
here a loose interpretive framework, resting on the proposition that war 
represents, at once, a temporary reorganization and rejustification of the role 
structure of the society, and a shuffling (often substantial) of the individuals 
who occupy those roles. War, of course, is far too varied an activity to allow 
easy sociological generalization. The means by which governments wage war 
differ sharply in the military manpower systems employed, in their rela- 
tionships to peacetime military manpower, and to the civilian economy, and 
to how war's dislocations are intentionally buffered by the state (Pinch 1982, 
Moskos 1988, Segal 1989). For present purposes the relevant dimension on 
which they vary is the extent to which they ask people in general (and 
individuals in particular) to become something that they have not previously 
been. While much interest has focused on the reorganization of personality in 
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the face of war, it ought to be understood in. interaction with the less traumatic 
modifications in the role reorganization thet war causes. Not only must new 
members for the military (with particular skills or capacities) be secured from 
the civilian population, but new, often speedier, paths must be devised within 
the military for filling critical roles that cannot be filled lateraily from the 
civilian population. Recruitment, training, ransfer, and promotion within the 
military, as well as changes in war itself, are highly consequential for the 
structuring of military forces and military careers, and thus for the impact war 
will bave when hostilities are over (Modell & Steffey 1988, Hauser 1987, 
Binkin 1986, Binkin & Kyriakopolous 1979). 

How careers are formed in the military zs a product of the particular tasks 
performed there, and these are considerably modified by war. The immense 
dynamism of war provides large numbers »f individuals (not randomly cho- 
sen) with experiences in organizational roles to which they would never 
otherwise have been exposed. Most obvicusly, combat roles become more 
prominent in a military in war, but so do lozistical, intelligence, and teaching 
roles. If the homefront roles change less du-ing war than those in the military, 
this lack of situational novelty is only relative. The absence of soldiers and the 
demands of war production modify who holds what homefront roles. 

Although little has appeared in our review of the recent literature, the 
demography of war is another relevant theme. Demographic structures mod- 
ified by war may, in turn, influence the way survivors move through their 
own postwar life courses. Wartime role chenges, in turn, can be influential in 
the long run, according to the mechanisms that evaluate the individual's 
wartime activities in determining postwar social placement, as the distribution 
of roles moves toward a return to that of the prewar period. Only with regard 
to women’s work, and then mainly for World War II, do we have much sense 
of how the changed configuration of roles when war ended affected the way 
women's extraordinary wartime activities vere evaluated in the postwar labor 
market and elsewhere in women's lives (Campbell 1984, Hartmann 1978, 
Milkman 1987). Despite work that treats "he elements separately, we know 
little about how war has produced the chznging interrelations among labor- 
force participation, family formation, and formal education, in persons who 
adopted new roles during the war (Model 1989). We have remarked how 
seemingly important formal education has been for men leaving the military 
after war; our sense of just how this works is quite imprecise, as Nam's (1964) 
careful review shows. i 

Finally, we must.consider the more sobering aspects of the consequences of 
war: the creation of the “veteran” role aad the infliction of war wounds, 
physical, psychological, and “hidden.” Despite a plethora of professionals 
whose occupational assignments direct their attention to veterans, and despite 
some excellent scholarly work on policies toward veterans, little: has been 
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done to assess what kind of a difference it makes to the individual in his 
civilian capacity, or to the society of which he is a member, when he is a 
veteran (cf. Jennings & Markus 1977, Schreiber 1979, Janowitz & Wesbrook 
1983). In 1970, over three million Americans were receiving VA disability 
support. Is this category sociologically nugatory? If we move beyond attested 
disabilities to psychological wounds, we must recognize that we are dealing 
with an aspect of the way individuals understand themselves that is socially 
constructed, and that war is not something solidly in the past and influencing 
present behavior but is, rather, dialectically tied up with present concerns, and 
is something both highly personal and part of a very public debate. 
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Abstract 


For reasons of cost and ease of implementation, mail surveys are more 
frequently used for social research than are either telephone or face-to-face 
interviews. In this chapter, the last two decades of research aimed at improv- 
ing mail survey methods are examined. Discussion;of:this research is orga- 
nized around progress made in overcoming four important sources of error: 
sampling, noncoverage, measurement, and nonresponse. Progress has been 
especially great in improving response rates-as a means of reducing nonre- 
sponse error. Significant progress has also been made in finding means of 
overcoming measurement error. Because mail surveys generally present few, 
if any, special sampling error problems, little research in this area has been 
conducted. The lack of research on noncoverage issues is a major deficiency 
in research to date, and noncoverage error presents the most significant 
impediment to the increased use of mail surveys. The 1990s are likely to see 
increased research on mail surveys, as efforts are made to incorporate mail 
into cost-effective mixed mode designs. 


INTRODUCTION 


More sample surveys are done by mail than by any other survey method. An 
examination of all federal surveys approved by the US Office of Management 
1225 
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and Budget and active in August 1981 revealed that nearly 8 out of 10 utilized 
self-administered questionnaires. Of these 2137 surveys 69% were conducted 
solely by self-administered procedures, versus 2.2% solely by telephone, and 
9.4% solely by face-to-face interviews. An additional 11% involved self- 
administered questionnaires in combinatiom with one of the other methods. 
The report concluded that at least 90% of those categorized as self- 
administered surveys were implemented by using mail procedures (US Office 
of Management and Budget 1984). 

Two major reasons for the extensive use of mail survey methods are, first, 
there is the much lower cost for completing -hem; second, procedures for mail 
surveys are often deemed simple enough taat individuals and organizations 
conduct their own rather than relying upon survey research organizations. If 
viewed from the standpoint of large-scale general public surveys done through 
national survey organizations, there is little doubt that telephone surveys are 
now the nation's most used method for la-ge-scale general public surveys. 
However, when surveys of all populations. including those implemented by 
professors, graduate students, government, schools, and businesses, as well 
as the national survey organizations are included, the prevalence of mail 
survey procedures is evident. 

The literature on mail survey methods is voluminous. A bibliography in the 
mid-1970s on research to improve mail survey procedures included nearly 
250 entries (Dillman 1978). A more recent bibliography, including only items 
published since 1970, compiled in preparat:on for the writing of this chapter, 
included more than 400 entries (Dillman & Sangster 1990). 

Our purpose here is selectively and criticelly to review major developments 
in the design and administration of mail surveys during the last two decades, 
as they are conveyed by this literature. Impcrtant issues for future research are 
also identified. 

Much but by no means all of the literature has been authored by sociolo- 
gists. Sociology is only one of many disciplines that depend on data collected 
by mail surveys for the scientific advancement of the discipline. Statistics, 
psychology, marketing research, economics, and the various health sciences 
are disciplines in which research efforts to improve mail survey methods are 
regularly conducted and reported. In this review we are less concerned with 
the origin of particular research than its rele~ance to improving the quality and 
quantity of response to mail surveys, and im particular to resolving the major 
error issues. To these issues we now turn 


PERSPECTIVE TAKEN IN THIS REVIEW 


The sample survey is distinguished from cther research methods frequently 
used by sociologists by its ability to estimate quantitatively the distribution of 
a characteristic in a population, and to accomplish this by obtaining informa- 
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tion (answers to questions) from only a small proportion of that population, 
usually a few hundred or thousand. Thus, one can learn from a sample survey 
such things as what percent of a city's residents own the home in which they 
live, what proportion of state government employees are satisfied with their 
jobs, and the extent to which residents of large cities have greater fear of 
crime than do residents of small cities. 

To generalize the results for the sample to the population from which it is 
drawn, one must normally contend with at least four potential sources of 
error, any one of which may make the survey results unacceptable. These 
sources of error include: (a) sampling error, (b) noncoverage error, (c) 
nonresponse error, and (d) measurement error (Groves 1989). Efforts to do 
quality mail, or for that matter telephone or face-to-face, surveys require that 
we attempt to eliminate, or at least reduce, all four types of error. Each kind 
of error describes a reason for the discrepancy between the population attrib- 
ute measured and the estimate derived from the sample survey. 

Sampling error results from heterogeneity on the survey measures among 
members of the population. It is attributable to the fact that certain members 
of the population are deliberately excluded by selection of the subset of 
members for which responses were obtained. In practice, sampling error is 
mostly a reflection of the number of respondents surveyed and provides the 
basis for statements that one frequently hears such as, "This randomly drawn 
sample of 400 respondents provides results that are accurate within 5 percent- 
age points." In general, quadrupling sample size decreases sampling error by 
one half. Sampling error is the aspect of survey quality examined through 
inferential statistics applied to sample survey results, from which conclusions 
about significant differences in the population are obtained. 

The principles for sampling were, for the most part, developed early in this 
century. They have presented particular challenges for face-to-face interviews 
where multistage cluster designs are typically used to reduce costs, and for 
telephone procedures where methods for efficiently selecting random digits to 
access households had to be developed. Mail surveys typically rely on lists for 
which simple random or simple systematic sampling procedures are used so 
that very little research on ways of reducing sampling error in mail surveys 
has been done. For this reason, sampling error receives little attention in this 
paper. 

Noncoverage error arises because some members of the population are not 
covered by the sampling frame and therefore have no chance of being selected 
into the sample. Reducing this source of error is an important research topic 
for mail surveys. Noncoverage error is one of the major reasons that mail 
surveys have not been as useful as desired in surveying the general public. 
Because of the small amount of mail survey research in this area we shall 
consider it only briefly, although in an important way, in this paper. 
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Nonresponse error stems from the fact that some of the members of the 
sample population do not respond to the survey questions. The vast majority 
of research on improving mail survey methods has focused on response rates, 
the generally accepted indicator of nonrezponse error. This near singular 
focus on response rates as a means of improving results to mail surveys occurs 
because low response has long been considered the major problem of mail 
surveys. Just as important, however, is the perception fostered by much of the 
literature that such research could be successful in identifying means of 
improving response rates, in contrast to the imited opportunities for overcom- 
ing noncoverage error. Thus, nonresponse error is the major focus of this 
paper. 

Measurement error refers to the discrepancy between underlying, un- 
observed variables (whether opinions, behaviors, or attributes) and the 
observed survey responses. Whereas the three preceding types of error (sam- 
pling, noncoverage, and nonresponse) stera from nonobservation, measure- 
ment error results from the process of observation. In practical terms, 
measurement error results from respondent characteristics, e.g. their inability 
to provide accurate information or a mctivation for whatever reason to 
provide inaccurate information. Measurement error may also result from 
characteristics of the question (e.g. a question phrased so that it cannot be 
answered correctly) or of the questionnaire, {e.g. the order in which questions 
are presented). 

In recent years, there has been a great deal of interest in mixed mode 
surveys, i.e. data collected by two or more methods for a single survey 
purpose (Dillman & Tarnai 1988). This interest, propelled by rising costs and 
a recognition that different members of a single household may be accessible 
to one method but not another, forces an explicit recognition of the survey 
mode itself as a source of measurement error. The fact that mail surveys 
depend upon visual communication and tie. ability of people to read and 
write, that telephone surveys depend on aural communication, and that 
face-to-face interviews may involve all of the senses raises the specter of 
different methods producing different resuks. Thus, it should not come as a 
surprise that measurement error is a rapidly expanding topic of research for 
mail as well as other types of surveys. 

A good sample survey, by whatever metnod, is one in which all members 
of a population have a known opportunity to be sampled for inclusion in the 
survey (noncoverage error is avoided); the people to be surveyed are sampled 
by random methods in sufficiently large nurabers to provide a desired level of 
precision (sampling survey error is limited); questions are selected and 
phrased in ways that result in people providing accurate information 
(measurement error is avoided); and everycne who is included in the sample 
responds (nonresponse error is avoided). Seidom if ever does a sample survey 
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accomplish all of these goals. However, this multifaceted concern becomes a 
standard by which progress in the development of mail survey methods can be 
measured. 


REDUCING NONRESPONSE ERROR 


A low response rate does not necessarily entail nonresponse error, i.e. a 
discrepancy between the frequency of~a population characteristic and that 
estimated by the survey that occurs because some people did not respond. 
Those who respond to a survey may not differ in any measurable way from 
those who do respond. However, the usual reason that surveys are com- 
missioned is that the distribution of certain population characteristics is 
unknown and a survey becomes the tool to find out the distribution of those 
characteristics. Thus, it is usually impossible to compare respondents with 
nonrespondents on precisely those variables of most survey interest. For 
populations in which a few characteristics of the completed sample are known 
(e.g. grades and age for a student sample), limited respondent-nonrespondent 
comparisons can sometimes be made. However, even when one can be 
confident that no differences exist on these variables, one still does not know 
whether differences exist on those variables of interest that led to the decision 
to conduct the survey. It is for this reason that response rate has de facto 
become the accepted proxy for nonresponse error. The general assumption is 
that the higher the response rate the lower the potential of nonresponse error 
and therefore the better the survey. 

Low response rate, more than any other issue, has given mail surveys a 
poor image among social researchers. Prior to the 1970s, mail surveys were 
generally seen as producing response rates unacceptable for serious sociologi- 
cal research (Dillman 1978). Thus, it is not surprising that the vast majority of 
mail survey research has, and continues to be, focused on this issue. 

The research literature is dominated by hundreds of tests of specific pro- 
cedures and techniques for improving response rates, typically manipulated 
one or two at a time. Examples of techniques tested include: financial 
incentives, material incentives (e.g. ballpoint pens), follow-up reminders, 
timing of follow-ups, personalization of correspondence, anonymity of re- 
sponse, questionnaire layout, questionnaire length, color of questionnaire, 
type of outgoing postage, type of return postage, content of cover letter, 
source of survey sponsorship, and higher rate of postage (for example, see 
Dillman 1978). . 

Most such articles are in themselves quite limited in what can be learned 
about how to improve response rates. Studies differ dramatically with respect 
to the population that is surveyed, topic, length of questionnaire, and es- 
pecially procedures other than the test procedures used to achieve a higher 
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response rate. Thus, when one learns thar prior notice is found to increase 
response rate in a survey in which no follow-up is used, it is impossible to 
deduce what would happen if several follcw-ups (now, a generally accepted 
procedure) were to be employed. Also, if one finds that the color of stationery 
increases responses from 25% to 30%, still quite a low final response rate, 
one is left wondering whether such a diffzrence would also be observed if 
additional follow-ups, personalized correspondence, or other techniques were 
used. Would these results be replicated with at least a 50% response rate in the 
control group? To put it another way, wo.ld a technique that was found to 
make a difference in conditions that led io a low response rate work less 
effectively under conditions that led to a higher response rate? 

The last two decades have witnessed thrze distinctively different efforts to 
move beyond the limitations of these individual studies. Each effort attempts 
to assess results from a large number of previous efforts and to ascertain how 
response rates can be improved. We label -hese efforts here as (a) subjective 
assessments, (b) quantitative assessments, and (c) design of comprehensive 
systems. 


Subjective Assessments 


In 1961, Christopher Scott reported a lergthy and detailed comprehensive 
review of existing research, intended to reach conclusions about which tech- 
niques could be reasonably expected to improve response rates and which 
could not. His major conclusion was that follow-up mailings were most 
effective, a finding reaffirmed by every cormprehensive literature review since 
that time. Other factors judged by Scott to make a positive difference included 
prior notice, financial incentives, special postage, sponsorship, stamps on the 
return envelope, personalization, and intesest in the survey topic (see Table 
1). More than a decade later, Linsky (1975), in another exhaustive review of 
the literature, reaffirmed the importance of all but one of these considerations 
(“interest in the topic" was not included im the review). Similar conclusions 
were reached by Kanuck & Berensen (1975), in a review that focused in large 
part on research conducted in the marketing research tradition, and Duncan 
(1979) in a review that emphasized management research. Techniques or 
study characteristics that did not make a di/ference, according one or more of 
these reviews, included nature of the questionnaire, deadline date, promise of 
anonymity, nature of the cover letter, anc questionnaire length. 

Another review, limited to the use of financial incentives, sought to specify 
the nature of incentives that worked best (Armstrong 1975). Only prepaid (as 
opposed to postpaid) monetary incentives appeared consistently to make a 
difference, and in general, nonresponse "was reduced by about one third 
through their use. Perhaps the most striking finding of these studies is that on 
virtually every topic (with the possible exceptions of follow-ups and financial 
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incentives), contradictory findings existed that could not be easily dismissed 
as the result of differences in study designs. 

Although these studies each made an important contribution to assessing 
systematically the relative importance of various factors influencing response 
rate, their conclusions were necessarily limited by the author’s subjective 
assessment of effects. In addition, each represented only a compilation of 
findings from previous literature; there was no effort to integrate them theoret- 
ically to determine which elements should be combined in what way to obtain 
the highest possible response rate. The limitations of this approach are 
explicitly stated in a quite recent review by Harvey (1987) done in this 
tradition. He reaffirms the importance of precontact, stamped return en- 
velopes, follow-ups, and economic incentives but concludes that very differ- 
ent factors may be at work in different situations and that these are probably 
not stable across populations. i 


Quantitative Assessments 


In 1978, Heberlein & Baumgartner published an innovative, quantitative 
assessment of procedures used in previous mail survey research. Two hundred 
forty (240) manipulations applied to 98 mail survey experiments were treated 
as if they were respondents to a survey. Seventy-one characteristics of each of 
these “respondents,” i.e. surveys, were coded; they included four general 
category characteristics: (a) research study attributes (e.g. type of sponsoring 
research organization); (b) sampling and sample characteristics (e.g. type of 
population such as students or government employees); (c) questionnaire 
characteristics (e.g. salience of topic); and (d) research procedures (e.g. a 
special third contact). These researchers then used a regression analysis 
procedure to predict final response rates, using the 71 characteristics as 
independent variables. 

Heberlein & Baumgartner found that number of contacts and salience of the 
topic explained 51% of the variation in the final response rate. A 10-variable 
model, including these and other variables, predicted 66% of the variation in 
the final response rate. The predictive variables included: market research, 
sponsorship, general population, and length (all negative effects); and govern- 
ment sponsorship, employee population, school or army population, special 
third contact, and incentive on the first contact (all positive effects). 

Goyder (1982) replicated this study on an extended sample of 330 surveys 
with similar results, except that the negative effect of market research organi- 
zation sponsorship disappeared. The negative effect of general population and 
the positive effect of employee population were also significantly less. An- 
other replication by Eichner & Habbermehl (1981), utilizing studies from 
Austria and West Germany, suggested potential cross-cultural differences. 

A major contribution of these quantitative assessments is to focus attention 
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squarely on structural constraints of doing surveys (e.g. sponsorship and 
population to be surveyed). Prior to this time research had mostly focused on 
identifying manipulable procedures the might be shaped to improve response. 
The identification of salience of topic as one of the most important determi- 
nants of response, along with the effects of structural constraints, made it 
clear that the potential for reducing nonresponse error varies considerably 
across types of surveys. One legacy of this model is a quantitative means for 
researchers to learn how response to a survey might be improved by making 
particular changes (e.g. adding an incentive or cutting pages from the ques- 
tionnaire), an idea eventually expanded further by Bruvold & Comer (1988). 
Heberlein & Baumgartner also added a new slant to response rate research by 
attempting to predict response to the initial mailing and then using that result 
to predict overall response rate; this idea usefully separated sources of early 
and late effects in the survey process. 

These quantitative approaches also exhibit certain unavoidable limitations: 
missing data (because authors of the original individual studies did not report 
details on nonexperimental factors), and subjective decisions on how vari- 
ables such as salience were to be coded. The quantitative assessment tech- 
nique is also limited in its ability to provide information about techniques not 
tried or used in only a few past studies. Further, the analysis is empirically 
driven, as is most of the past literature, so that the process of combining 
techniques becomes mechanical, devoid >f guidance from conceptions of 
why people behave as respondents or nonzespondents. Finally, as generally 
noted by the authors, regression analysis :s not a substitute for a very large 
factorial design in which many variables would simultaneously be manip- 
ulated. 

Fox et al (1988) developed a somewhat cifferent quantitative assessment of 
past studies in an effort to overcome some of the limitations of the regression 
analysis method. Their meta-analysis invelved accumulating statistical evi- 
dence against a null hypothesis across individual studies to determine.,an 
overall level of significance at which the hypothesis could be rejected. This 
study was, in many ways, less ambitious than that of Heberlein & Baumgart- 
ner as only nine variables were tested. The six shown to be significant 
included university sponsorship, prenotification by letter, postcard follow-up, 
first-class outgoing postage, and questionnaire color (Table 1). In contrast, 
notification of a cutoff date and a postscrirt to the letter were not significant. 
Some of their conclusions (e.g. questionnaire color) may be questionable 
because of the small number of studies available for review. Armstrong & 
Luske (1987) provide a similar meta-analysis with respect only to the use of 
postage and presumably show that applying postage to a return letter (versus 
business reply envelopes) makes a positive difference. 
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Design of Comprehensive Systems 


Efforts to design comprehensive systems with the explicit goal of maximizing 
response rates represent a quite different approach to the problem of reducing 
nonresponse error. Though informed by past research, such an approach does 
not seek to summarize it either statistically or qualitatively. Rather, an attempt 
is made to construct a system of procedures and techniques that will maximize 
response rate, in ways consistent with obtaining quality responses. 

An example of this approach is the Total Design Method (TDM) (Dillman 
1972, 1978, 1983, Dillman et al 1974). The TDM utilizes social exchange 
theory to guide the careful integration of specific procedures and techniques, 
many of which were found important by both the subjective and quantitative 
assessments just reviewed. The theoretical framework used in this approach 
posits that questionnaire recipients are most likely to respond if they expect 
that the perceived benefits of doing so will outweigh the perceived costs of 
responding. Thus, every visible aspect of the questionnaire development and 
survey implementation process is subjected to three design considerations: the 
reduction of perceived costs (e.g. making the questionnaire appear easier and 
less time-consuming to complete), increasing perceived rewards (e.g. making 
the questionnaire itself interesting to fill out by adding interest-getting ques- 
tions), and increasing trust (e.g. by use of official stationery and sponsorship) 
that the promised rewards will be realized (Dillman 1978, 1983). 

Among the consequent TDM design recommendations are the following: 
ordering questions to assure that interesting ones related to the topic described 
in the cover letter come first; use of graphical design and numerous question- 
writing principles to ease the task of reading and answering questions (e.g. 
consistency in the use of large [capital or dark] letters and contrasting small 
letters); printing the questionnaire in a booklet format with a topically neutral 
but interesting cover; use of photo reduction of regular sized type to make 
pages seem smaller and easier to complete; use of four carefully spaced 
mailings, including a postcard follow-up one week after the original mailing, 
a replacement questionnaire and cover letter informing the recipient the 
questionnaire has not yet been received four weeks after the original mailing, 
and a second replacement questionnaire and cover letter seven weeks after the 
first mailing, sent to nonrespondents by certified mail; individually printed, 
addressed, and signed letters; addresses printed onto envelopes rather than on 
address labels; use of smaller than usual business stationery to reduce costs 
and make the entire request appear smaller and easier to comply with; cover 
letter content that includes descriptions of the study's social usefulness and 
why the respondent is important (repeated in different ways in each of the 
subsequent mailings); explanation of identification numbers and how con- 
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fidentiality is protected; folding of outgoinz materials in a way that contrasts 
with advertising mail. Other details of the TDM, in particular how they are 
integrated to create a holistic effect, are described elsewhere (Dillman 1978, 
1983). 

The major strength of the Total Design Method as a comprehensive system 
is that meticulously following the prescribed procedures consistently produces 
high response rates for virtually all survey populations. Response rates typi- 
cally reach 50—7096 for general public surveys, and 60-80% for more 
homogenous groups where low education is not a characteristic of the popula- 
tion (Dillman 1978, 1983, Mullen et al 1287). Response rates greater than 
80% have frequently been reported (cf Crosby et al 1989, Dillman 1978). To 
my knowledge, no study that has utilized a 12-page or smaller bóoklet and 
followed the TDM in complete detail, from questionnaire through the full set 
of implementation procedures, has obtained less than a 5066 response rate. 
The TDM has also produced comparable response rates in Europe (Nederhof 
19832, b, 1985, DeLeeuw & Hox 1988), Japan (Jussaume & Yamada 1990), 
and Australia (Graetz 1985). 

Another strength of the TDM is that it is theoretically driven, removing 
emphasis from individual techniques and stressing how elements fit together. 
The emphasis is placed not on a particular technique (e.g. personalization or 
follow-ups), but on how these procedures can be linked to influence question- 
naire recipients positively. This theoretical guidance focuses the design upon 
projecting a consistent positive image of tbe respondent. Although the TDM 
incorporates many features found importan: in past research, it treats some of 
them differently. For example, Heberlein & Baumgartner’s work (1978) 
revealed that Jack of salience of the quesGonnaire topic was a constraint to 
high response. The TDM approach treats salience as a manipulatable variable 
and incorporates a number of efforts to increase salience of topics (e.g. 
questionnaire cover, question order, and addition of interest-getting ques- 
tion). 

The TDM also exhibits a number of significant shortcomings. First, there is 
a lack of empirical evidence that all, or evea most, of the dozens of prescribed 
details make a difference. Some details are particularly open to question: the 
size of stationery, size of the booklet, and method of folding questionnaires. 
Work by Jansen (1985) has questioned the usefulness of photo reduction and 
shown that a larger size can produce a high=r response rate. Limited evidence 
exists that the degree of overall adherence to the TDM significantly influences 
responses; the evidence is provided by a study in which research investigators 
located in 13 different states each admin:stered the identical questionnaire 
(which closely followed TDM procedures) to similarly drawn samples of the 
general public in their respective states. An “Adherence to TDM” index was 
calculated which scored each state on how many of the 22 implementation 
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details of the TDM were utilized. State scores on this index ranged from 6.5 
to 21.5. Response rates ranged from a low of 38 percent (of the question- 
naires sent in the original mailing) for the state with the lowest score, to 65.1 
percent in the state with the highest score. A Spearman rank-difference corre- 
lation of .81 confirmed that adherence to details of the TDM implementa- 

: tion process correlated closely with the final response rates (Dillman et al 
1984). 

Evidence also exists that certain specific features of the TDM make a 
significant difference as part of this comprehensive mail survey system. These 
features include follow-ups (Dillman et al 1984), the certified third follow-up 
(House et al 1977, Dillman & Moore 1983, Nederhof 1985a, b, and DeLeeuw 
& Hox 1988), follow-ups that inform people their questionnaire has not yet 
been received versus those that provide a deadline date (Nevin & Ford 1976), 
and personalization (Dillman & Frey 1974). 

The effect of graphic cover design has also been tested. Grembowski 
(1985) found that response rates were higher for a cover that had more 
salience to the questionnaire recipients. A graphical representation emphasiz- 
ing fluoridation as a means of protecting children's dental health performed 
better than a cover emphasizing reduction of dental costs in a survey of 
parents, all of whom who had dental insurance plans as part of their employee 
benefits, and for whom cost was probably not a major issue of direct concern. 
Nederhof (1988) conducted a study in which identical covers were developed, 
except that one questionnaire cover was made to stand out more than the other 
by filling open spaces with black. This "high contrast" cover was intended to 
be more memorable and thus easier for the questionnaire recipient to recall 
and retrieve it. His hypothesis proved correct in a survey of Dutch biotech- 
nologists, with 75.646 of the recipients returning the black-dominated ques- 
tionnaire versus 60% who returned the low contrast (or white) version. More 
importantly, the entire difference was attributable to returns received from the 
second wave, i.e., the postcard reminder when no replacement questionnaire 
was enclosed. Nearly identical returns were received from the first and third 
mailings (a fourth one was not used), suggesting that the effect was due 
entirely to the "greater retrievability" at the time of the postcard follow-up that 
encouraged the recipient to find and return the questionnaire. 

Other research has demonstrated that nearly comparable response rates can 
be obtained by substituting a telephone follow-up for the final certified 
mailing, when a telephone number is available (Dillman & Moore 1983, 
Dillman et al 1984). This finding is particularly important because of societal 
trends toward greater work force participation for all adults in a household and 
toward the existence of more single person households. When a certified letter 
is delivered and no one is at home to sign for it, then a note is left asking the 
respondent to come to the post office to pick it up. This procedure allows an 
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opportunity to identify ineligible respondents who had simply ignored other 
mailings, thus improving the researcher's ability to calculate response rates 
accurately. 

In addition, four experiments (Dillman [978, Dillman & Moore 1983) have 
consistently shown that a modest increase in response, from 2-4%, can be 
obtained by using stamped return envelcpes instead of business reply en- 
velopes. Except for these changes—in the substitution of a final telephone 
reminder for the certified mailing, the use of stamped return envelopes, and 
use of modern word processing equipment to print questionnaires and provide 
better graphical design on them—the TDM remains mostly unchanged. We 
still know relatively little about the importance of many of the originally 
prescribed details. It does appear that the advent of desk-top publishing has 
led many users of TDM procedures, inclucing the author, to switch to slightly 
larger questionnaire sizes (folded legal size paper), more use of color, and less 
attention to folding and inserting the quest onnaires in the prescribed manner. 

A particularly glaring shortcoming of tte TDM is that it ignores the use of 
financial incentives, whose importance in :mproving response rates according 
to past research appears second only to the use of follow-ups. Relatively few 
studies have used a full TDM procedure amd experimented with various types 
of financial incentives. An exception is a recently completed study by Roberts 
& Hite (1990) which showed that respondents to a TDM household survey 
who received a $1.00 check were significaatly more likely to respond, 72% to 
65%. A study by James & Bolstein (190) has also demonstrated positive 
effects for both $1.00 and $2.00 incentives on TDM surveys. Finally, a recent 
study by Johnson & McLaughlin (19901, using a 32-page questionnaire, 
obtained an 82.6% response rate with a $5.00 cash prepayment versus nearly 
identical questionnaire but lower response rates for a $10.00 check after 
completion (72.3%) and for no financial incentive (71.3%). However, the 
$10.00 post payment resulted in a significently lower item nonresponse, 5.3% 
versus 9% for other respondents. The use of material incentives in TDM 
surveys has been attempted but generally they have not had any discernable 
effect on response rates (Nederhof 1983t, Mullen et al 1987). 

A third limitation of the TDM may te the reliance on social exchange 
theory. This theory was developed to explain the nature of face-to-face 
interaction and the social structures that evolved from those interactions (Blau 
1964 and Homans 1961). Mail survey respondents and researchers do not, as 
a general rule, come into contact with one another. Although the com- 
munications efforts (as many as four of which originate from the researcher) 
can reasonably be viewed as attempts at irteraction, this application of social 
exchange theory goes somewhat beyond its traditional development. Other 
theoretical perspectives might provide tre basis for development of com- 
prehensive mail survey systems (e.g. dissonance theory) (Hackler & Bour- 
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gette 1973), self-perception theory (Hansen 1980), attitude-behavior and 
altruism theoretical frameworks (Heberlein & Baumgartner 1978), and a 
broad social-values perspective that places emphasis on respondent's con- 
cerns of privacy (Goyder & Leiper 1985). However, none of these theoretical 
orientations has, as yet, been utilized individually or in concert with others to 
develop a comprehensive survey system aimed at minimizing nonresponse 
error. 

A fourth limitation of the TDM is~that the same set of procedures is 
recommended for all populations, thus giving it a one-size-fits-all nature. This 
puts the socia] exchange approach that underlies the TDM somewhat at odds 
with the TDM procedures derived from it. It is reasonable to expect that 
different people value somewhat different returns for their actions to some- 
what different degrees, and that these differences would be reflected in the 
perceived balance among rewards and costs associated with completing a 
questionnaire. For example, the reduced size of print and questionnaire 
advocated in the TDM may produce a positive response among young people, 
whereas enlarged print and size of questionnaire might do the same for older 
people. Thus, customization of detailed procedures to particular populations 
might be expected to maximize response for different survey populations. 

This possibility is demonstrated by Teske et al (1983), who has used a 
modified version of the TDM for numerous crime, victimization, and abuse 
surveys in Texas. Their procedures involved a TDM-like booklet question- 
naire and five contacts, including a prenotice, first mailing of a questionnaire, 
a postcard reminder, second mailing of a questionnaire, and a final reminder 
postcard. Personalized letters and stamped return envelopes were also used. 
Eleven statewide general public surveys that utilized these procedures 
obtained response rates of 61-77%. 

Closely related to this one-method-fits-all limitation of the TDM is the lack 
of recognition of how different survey situations may require quite different 
survey procedures. For example, one-of the biggest uses of mail surveys in the 
United States is requests to complete television viewing diaries in order to 
estimate audiences in each of the nation's television markets. Typically these 
surveys ask large numbers of respondents to fill out their diaries for a specific 
week; an emphasis is placed on not doing it by recall after that week is over. A 
standard TDM approach with contacts spread out over seven weeks simply 
would not work or be useful. A procedure commonly used by television 
research companies to complete such surveys involves two or more pre- 
contacts, two more contacts by mail and/or telephone during'the diary week to 
remind people of the need to complete the questionnaire as they watch 
television, and'another notice to return the questionnaire at the end of the 
diary week. 

Another situation which a standard TDM procedure does not fit is that 
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when questionnaires can only be hand-delivered to respondents (e.g. airports, 
shopping centers, and museums where the questionnaire is handed to visitors 
who are asked to return it at a later time, or even the drop-off of question- 
naires to an area of a probability sample o7 households). Machlis & Dillman 
(1990) have developed a modified TDM procedure for use in such situations. 
They combine social exchange and foot-ia-the-door principles to develop a 
sequence of requests that involves first asking respondents to pull to the side 
of the road, second, asking if they woulc be willing to answer three short 
questions and complete a mail-back questionnaire after their visit to the park, 
and third, asking for a mailing address so that a “thank-you” can be sent to 
them. The resultant procedure produced response rates of 84-88% from four 
national parks versus 31-42% in three parks where the original procedure was 
simply handing out questionnaires to randcm samples of visitors as they paid 
the park entrance fee. 


Conclusions and Needed Research 


In a regression analysis of the response rate determinants of 500 different 
surveys, Goyder found that on average the response rate for mail surveys was 
only 7.5% lower than for face-to-face interview surveys included in the 
sample (1985). Heberlein & Baumgartner, in the previously discussed quan- 
titative assessment of mail survey resporses, found that the surveys they 
examined had obtained response rates o^ over 60%, and one fourth had 
response rates higher than 80%. DeLeetw (1990) in a meta analysis of 
method comparison studies found that meil survey obtained response rates 
only eight percentage points lower than face-to-face interviews. Juxtaposition 
of these findings with those from TDM stucies leads me to conclude that there 
is no longer any reason to accept low response rates to mail surveys. Pro- 
cedures are available to assure response rates of at least 50% and higher than 
that for most populations. Lower response rates can no longer be excused on 
the grounds of the inadequacy of the ava lable mail survey procedures. 
Further, the consistency among majo- findings from past assessment 
efforts, as shown in Table 1, suggests substantial agreement on which tech- 
niques influence response. Improvements m our knowledge of how to reduce 
response error in surveys will not be advenced much further bv the type of 
article that has dominated and continues to dominate mail research, i.e. 
experiments on one or two techniques app-ied to base procedures that obtain 
low response rates. It can be assumed that there is a theoretical maximum 
response asymptote, which we cannot hope to exceed except through the use 
of culturally unacceptable inducements. Although for some populations this 
response asymptote may be 100%, for others it may be only 80-85%. 
Application of current knowledge can be expected to take response to within 
10—20 percentage points of this theoretical maximum. The kind of technique- 
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oriented..response rate experiments needec in the future are ones that use a 
comprehensive system as a base procedure and then are designed so that small 
incremental effects, e.g. two or three percentage points, can be identified as 
the asymptote is approached. 

A second research need is to shift from a focus on techniques like those 
described in Table 1 to an emphasis on how parts of the survey process 
complement one another. There is a need sor an entirely new set of concepts 
to capture these survey dynamics. For exemple, observation of people who 
make repeated requests of-another -persan 1o:do, something, with which that 
person has not yet complied, suggests;:thet the requester tends to pose the 
question in different-ways.each;itime-the request is made, presumably in 
search of an appeal that works. This obs2rvation suggests that "variety of 
contact" might be tested as a way of improving response rates. A survey 
procedure could be designed which uses d.fferent although compatible sizes 
and/or color of stationery, quite different leiter content, sent by different types 
of mail in each contact. The questionnaire night even be redesigned to make 
it shorter for a final mailing. Another concept worthy.of testing:is the 
"escalation.and de-escalation of intensity" associated with a.sequence of 
contacts. One might use a first contact that is very "soft", for example, 
mentioning that a survey will be coming ard that the sender would appreciate 
the respondent taking a look at it, and then increasing the intensity of the 
request culminating in a special delivery or express mailing that includes a 
substantial financial incentive in it. Alternatively, one might initiate a survey 
process with a handwritten, very personalized mailing that includes a finan- 
cial incentive and provides a deadline date, sent by express. mail, but then 
followed up with softer, more voluntary tyes of messages. Thus, one might 
test the hypothesis of whether a declining or escalating intensitv or an even 
balance among all contacts is most effective. The importance of the "variety" 
and "escalation" hypotheses here is less that these particular concepts them- 
selves may bear fruit than to illustrate how we might move beyond a focus on 
techniques to testing processes of an entire contact procedure for reducing 
nonresponse-error. 

Finally, as we approach the asymptote of the highest reasonably obtainable 
response, there is a great need to test for reactive effects that result in less 
accurate data due to pressing too intensely, for higher response;rates. The 
dilemma has been demonstrated in Sweden by Christoffersen'(1987), who 
-found-an educational bias in mail questionnaires, even when very high 
response rates were obtained; based upon other work, however, he concludes 
that intensive persuasion greatly compounds the error of responses. A con- 
trasting perspective is provided by DeLeeuw & Hox, who found in a TDM 
experiment that the use of personalization without a certified letter resulted in 
more socially desirable answers than wher:the. certified final follow-up was 
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used (1988). This issue underscores the very close linkage between nonre- 
sponse errér and measurement error, the issue to which we now turn. 


REDUCING MEASUREMENT ERROR 


Traditionally, concerns over measurement error in mail surveys have focused 
on item nonresponse, failure to obtain adequate answers to open-ended 
questions, the effect of low educational ‘levels, the lack of success with 
complex skip patterns, and the ability of the researcher to control the sequence 
in which questions-were asked: These problem areas tended to be balanced 
against two appareiit“advaiitagés, i.e. that the absence of an interviewer 
lessens the likelihood both of socially desirable responses and of interviewer 
subversion (Dillman 1978). 

During the 1980s, however, renewed attention has been focused on 
measurement error in mail surveys for two reasons. First, new research had 
persuasively demonstrated measurement error concerns in telephone and 
face-to-face interviews that are somewhat worse than many researchers had 
expected (Schuman & Presser 1981). Second, increasing numbers of mixed 
mode surveys apparently are being done to reduce costs and to deal with 
increasing error problems in face-to-face and telephone surveys, e.g. falling 
response rates (Dillman & Tarnai 1988, Goodfellow et al 1988, Brambilla & 
McKinlay 1987). Thus, the important questions about measurement error are 
not only how accurate are data obtained by mail surveys, but whether the 
ansWers are the same as those obtained for telephone and face-to-face in- 
terviews. The latter issue is especially important because many questions have 
no absolute standard of accuracy, e.g. attitude questions that utilize vague 
quantifier answer choices such as "strongly agree" or "somewhat agree." Mail 
survey research in these issues has been expanding but many questions remain 
unanswered. 


Do Mode Differences Exist? 


It is reasonable to expect that certain differences may occur when questions 
are answered in mail versus interview surveys. Mail survey respondents 
depend upon a visual presentation of information for which reading skills are 
necessary. In contrast, telephone respondents depend entirely upon aural 
skills for hearing the questions and upon memory and verbal skills for 
responding. All of these skills can be utilized in face-to-face interviews, but in 
general very little reading is likely to be required. The physical presence of 
the interviewer in the face-to-face situation, and the necessity of interacting 
with an interviewer for telephone interviews, also provide a basis for expect- 
ing that respondents might provide different responses. 

Thus, it is encouraging that relatively few differences have been reported 


242 DILLMAN 


for personal attributes of respondents or t3eir households that could not be 
attributed to nonmeasurement error differences such as differential response 
rates or noncoverage issues. The data aso appear mostly equivalent for 
easy-to-recall behavioral characteristics. 

In one of the first major self-administered (mail retrieval was not used) 
versus telephone comparisons, Bishop et al (1988) found that the order effects 
previously identified by Schuman & Presser (1981) were less likely to occur 
in mail surveys. Ayidiya & McClendon 11990) also tested several effects 
reported in the Schuman & Presser (1931) study. Their work generally 
supports that of Bishop et al (1988) in suggssting that response effects in mail 
surveys are somewhat less than in telephone surveys. They found that for the 
most part recency effects (choosing the last response category) were reduced, 
but that primacy effects (choosing the first offered category) persisted. They 
found no evidence that question form effects produced by nonopinion filters, 
such as offering a middle alternative on agree or disagree questions, would be 
eliminated in a mail survey. A limitation o7 the Ayidiya & McClendon study 
is that they report results only for a mai: survey and, therefore, can only 
compare their mail survey results with the original interview responses from 
the Schuman & Presser study. 

Another type of difference frequently 5bserved in mode research com- 
parisons is a tendency for telephone and face-to-face respondents to select 
more extreme answers than do mail respondents when vaguely quantified 
scaler categories are used. Mail respondents, on the other hand, tend to 
distribute themselves across the full scale. Such an effect was first reported by 
Hochstim (1967), who found that the percentage of respondents saying their 
health was excellent (compared to good, fair, or poor) was 30% by mail 
versus 44% by telephone and 47% in the face-to-face interview. Mail respon- 
dents were also more likely to report having discussed medical problems with 
their husbands, an issue which, Hochstim reasoned, respondents might have 
been embarrassed to admit to the female interviewer. 

In a survey of drinking behavior, Mangione et al (1982) reported no 
difference between mail and face-to-face: or telephone surveys on 12 de- 
mographic items, but found self-administe-ed respondents more likely to use 
middle categories as opposed to extreme ones on questions that involved 
elements of socially desirability. Similarly, Dillman & Tarnai (in press) found 
that telephone respondents were more likely than mail respondents to say that 
they "never" (as opposed to frequently, sometimes, or occasionally) drove a 
car after they had been drinking alcoholic beverages. 

A dental health study by Zapka et al (1938) showed that telephone respon- 
dents reported brushing more frequently than did mail respondents. In a 
survey of former hospital patients, Walker & Restuccia (1984) found that 
telephone respondents reported greater satisfaction with their recent stay. A 
needs assessment of university psychological services revealed that 49 of 50 
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items were rated higher by telephone respondents (Talley et al 1983). In a 
comparison of mail, telephone, and face-to-face procedures, on a wide range 
of scales, DeLeeuw (1990) found small differences in psychometric reliabil- 
ity, with the mail method being most reliable and the telephone being least 
reliable. 

In a three-way comparison of methods, Dillman & Mason reported no 
significant differences on demographic items (1984). However, in a series of 
nine items that required respondents to assess specific community issues as 
“not a problem," “a small problem," “a medium problem," or “a serious 
problem" in their community, telephone respondents and face-to-face respon- 
dents were more likely to choose the extreme response choice (not a problem) 
for all nine items. A similar tendency was observed on a second set of nine 
items-in-a-series measuring neighborhood quality on a seven-point numerical 
scale, and on five individual items using various scaler responses (Dillman & 
Mason 1984). In a later study, the nine community problem items and 
neighborhood quality items were adapted to student living situations and 
readministered to samples of students by telephone, as well as by in-class 
self-administered questionnaires, with similar results. Whereas a field survey 
had resulted in a difference of 6.7 to 19.3 percentage points in use of the "not 
a problem" category between the mail and telephone, the student survey 
produced even greater differences of 12.6 to 38.1 percentage points (Tarnai & 
Dillman in press). 

In sum, there is substantial evidence that some differences exist between 
responses to certain questions asked in mail and telephone interview surveys. 
However, most of these studies have been empirically focused and have made 
only very limited attempts to provide theoretical explanations for the differ- 
ences. 
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Potential Explanations. for Mode Differences 


At least three possibilities have been posed as explanations for the telephone 
extremeness effect noted above. Perhaps the most frequently offered explana- 
tion is that respondents to self-administered questionnaires do not have to 
interact with an interviewer and are therefore less inclined to offer socially 
desirable answers. This explanation seems likely to account, at least in part, 
for the differences reported by Hochstim (1967), Mangione et al (1982), 
Tarnai & Dillman (in press), Zapka et al (1988) and Walker & Restuccia 
(1984). Each of these surveys dealt with items for which a clear direction of 
social desirability could be identified. However, the nature of these studies 
was such that other effects could not be ruled out. 

A second possible explanation is based upon the fact that telephone respon- 
dents must hear and respond to one question at a time, and mail respondents 
are able to look at full pages of questions. Thus, mail respondents are able to 
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consider answers to questions that both precede and follow particular items 
when deciding upon an answer. This "cortext effect" has been hypothesized 
to account for the extremeness effect in the:series of community and neighbor- 
hood needs assessment items. In an experiment that varied the ability of 
respondents to see the context of each question as it was being asked of them 
over the telephone, Dillman & Tarnai havz shown that context accounted for 
some, but by no means all, of the extrer»eness effect (in press). 

A third explanation concerns differences in pace and control of the answer- 
ing process in telephone versus self-administered surveys (Hippler & Schwarz 
1987), (Schwartz et al (in press). Telephone interviewers read each question 
at a pace set by them, and there is considerable pressure on respondents to 
listen intently and respond when asked. This pressure may lead to a top-of- 
the-head tendency, whereby the respondents base their judgment on the first 
piece of information that comes to mind. This perspective may explain why 
self-administered questionnaires were found by Bishop et al (1988) and 
Ayidiya & McClendon (1990) to be somewhat less subject to order effects. It 
may also explain the extremeness effect ander the assumption that the end 
categories are more likely to be remembered than are the intermediate catego- 
ries. 

Much remains to be learned about responses to self-administered and 
interview surveys, but the above research suggests that progress is being 
made. A theory of response-order effects appears to be emerging from 
research done in interview surveys. It is based on a theoretical perspective 
reported by Krosnick & Alwin (1987). Taey argued that when long lists of 
items were presented to respondents in "isual form, the respondents were 
more likely to choose responses that appeared early in the list; in oral form 
later items were more likely to be chosen. Israel and Taylor (in press) 
demonstrate this tendency for choosing the first item in a mail experiment, but 
no telephone comparison was made. Hippler, Schwarz, Noelle-Neumann, 
and their colleauges have extended this work in a series of papers, showing, 
for example, that the theory applies as well to dichotomous questions (Hippler 
et al 1989). They also show that the need te compute a judgment on the spot is 
a crucial prerequisite (Schwarz et al 1939). However, this theory cannot 
account for the extremeness effect described by Tarnai & Dillman (in press), 
which resulted in telephone respondents trequently picking the first offered 
category more often than did mail respondents. These differences may stem 
from the fact that Krosnick & Alwin (1987) examined the choosing of 
answers from a list of nominal alternatives whereas Tarnai & Dillman (in 
press) used ordinal response choices. This response task required the respon- 
dent to determine where his or her answer fit on an implied scale. 

Finally, in face-to-face surveys the ability of respondents accurately to 
recall visits to doctors can be improved by the use of recall techniques such as 
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reconstructing details of the visit and the use of benchmark dates important in 
people's.lives (Means et al 1989 and Jobe & Mingay 1989). Recently, these 
techniques were applied to mail survey respondents by asking about the extent 
to which they wore seatbelts on their three most recent trips in a car. The use 
of a four-question sequence to reconstruct details of each trip resulted in 
increases of 13-16% of the mail and 11-14% of the telephone respondents 
who reported wearing their seatbelts all of the time (Dillman & Tarnai, 
forthcoming). In the past, one of the perceived advantages of mail surveys 
was that people had time to carefully think through questions before answer- 
ing them. These data raise the possibility that this image of mail question- 
naires is more a myth than reality, and that cognitively designed series of 
questions may be just as important for use in mail surveys as in interview 
surveys. 


Future Research Needs 


The last few years have witnessed a very promising start in raising critical 
questions about measurement error in mai] surveys. Potential explanations of 
some of these effects have been offered but much remains to be done in order 
to develop a theory of response effects. It is critical that this work proceed and 
that it be done in concert with work on telephone and face-to-face interview 
surveys if the benefits of this research are to be realized for the development 
of mixed mode survey applications. 


REDUCING NONCOVERAGE ERROR 


When complete, up-to-date lists of populations are available, as is frequently 
the case because of the increased prevalence of computers for compiling and 
storing lists, the noncoverage error is not usually a problem. Studies of 
students, professional association members, employers, health insurance re- 
cipients, and many other groups are frequent examples. Yet, the problem of 
noncoverage error is perhaps the single greatest barrier to the expanded use of 
mail surveys, primarily because of the problem it presents for doing house- 
hold surveys of the general public. There are no up-to-date listings that 
provide complete coverage of households in the United States to the degree 
that can be accomplished by area probability samplings for face-to-face 
interviews and random digit dialing for telephone surveys. Telephone directo- 
ries present the most readily available source of names and addresses, but are 
usually out-of-date. In some regions of the country half or more of the 
households are unlisted, and this trend continues upward (Survey Sampling 
1990). Other lists, such as city directories and drivers license lists, are 
frequently unavailable and suffer from other problems of noncoverage. 
The solution to the noncoverage problem involves learning how to develop 


246 DILLMAN 


efficient sampling and other procedures for dropping off and retrieving ques- 
tionnaires from households and perhaps developing response inducement 
procedures that can rely on household listings without names. As interest in 
conducting mixed mode surveys develops, it seems likely that such pro- 
cedures may be developed because of tbe tremendous cost advantage to 
delivering questionnaires to people's households versus having interviewers 
return many times, in order to obtain acceptable response rates. This issue 
represents a very high priority for future research. 


THE FUTURE OF MAIL SURVEYS 


We live in an information age. Informa ion is increasingly important to 
decisionmaking in all units of US society and is generally being substituted 
for other resources in the production of gcods and services (Dillman 1989). 
The ability of sample surveys to find out information about populations by 
surveying only limited samples suggests tkat the use of all types of surveys, 
and especially mail, will increase. For example, low cost desktop publishing 
techniques create the capability for conducting quality mail surveys in even 
small businesses, agencies, and institutions. Another factor suggesting the 
increased use of mail surveys is the increased cost and nonresponse problems 
associated with telephone and face-to-face interviews, which have moved us 
toward mixed mode surveys. 

During the last three decades, tremendous progress has been made in 
improving response rates to mail surveys. Measurement error issues have 
been identified, and much research is now being directed toward the resolu- 
tion of these issues. In contrast, the lack of research on noncoverage issues 
represents a major barrier to advancememt in the uses of mail surveys, 
especially for the difficult-to-survey general public. 

In the past, research on improving mail surveys has mostly been conducted 
outside the major research centers of the United States, where attention has 
been focused in large part on face-to-face and telephone methods. The advent 
of interest in mixed mode surveys, in an efort to find the most cost-effective 
mix of survey procedures for conducting important societal surveys, suggests 
that the attention of these research centers may increasingly be focused on 
mail surveys in this context. Should such irterest develop, the benefits for the 
advancement of mail survey research will be considerable. 
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Abstract 


Not only sociologists but also economists have made important contributions 
to economic sociology. Which particular works by economists are relevant in 
this context is indicated with the help of Schumpeter's History of Economic 
Analysis, a work unique in that it traces not only the history of economic 
theory but also-that of economic sociology. Three main traditions appear in 
economic sociology, which are still fairly unexplored: the German tradition of 
Wirtschaftssoziologie (1890—1930), the French tradition of sociologie écono- 
mique (1890—1930), and the US tradition of “economy and society" (1950s). 
Since the 1970s a revival of interest in economic institutions has occurred 
especially in the United States, and a new economic sociology has come into 
being. Both economists and sociologists helped to create this new economic 
sociology. Economists have developed an approach known as New In- 
stitutional Economics. The main idea here is to explain the emergence and 
functioning of economic institutions with the help of microeconomics. 
Sociologists have developed an approach to economic sociology, sometimes 
referred to as the “new sociology of economic life." This approach attempts to 
analyze core economic problems, as opposed to the problems traditionally left 
over by the economists for the sociologists to solve. Today's sociologists have 
done important work, particularly on the role of networks in the economy, on 
the structure of economic organizations, and on the role of culture in eco- 
nomic life. Though some weaknesses hamper both new institutional eco- 
nomics and the new sociology of economic life, they have breathed new life 
and vigor into economic sociology. 
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INTRODUCTION 


Economic sociology is currently going through something of a revival. New 
and exciting studies of such topics as markets and firms are proliferating, and 
a number of sociology departments have introduced economic sociology into 
their curricula. This development makes it imperative to try to establish what 
has already been accomplished and what remains to be done in economic 
sociology. In this review essay we argue that there exist several major 
traditions of economic sociology, and that economic sociology has an un- 
commonly rich and diverse background to draw on. 

First of all, there are the three major attempts by sociologists themselves to 
launch economic sociology: (a) the German tradition of Wirtschaftssoziologie 
with its origin in the Historical School of Economics and with Weber's work 
as its magnificent centerpiece; (b) the French tradition of sociologie écono- 
mique, which has its roots in Saint-Simon's analysis of "industrial society" 
and which culminated in the work of the Durkheimian School in the early 
twentieth century; and (c) the attempt by a few sociologists in the United 
States in the 1950s (especially Talcott Parsons and Neil Smelser) to bring 
*economy and society" closer together. But even though economic sociology 
was mainly created by sociologists, it should be noted that economists also 
have made important contributions. According to Joseph Schumpeter, who 
was himself very interested in sociology, it would be a serious mistake to 
identify economic sociology exclusively with sociology—it is equally much a 
part of economics. As he once phrased it: "there is nothing surprising in the 
` habit of economists to invade the sociological field. A large part of their 
work—practically the whole of what they have to say on institutions and on 
the forces that shape economic behavior—inevitably overlaps the sociolog- 
ist’s preserves. In consequence, a no-man’s land or everyman’s land has 
developed that might conveniently be called economic sociology” (Schumpe- 
ter 1951:134; emphasis added). To give a full picture of the major traditions 
of economic sociology, we therefore include a section on the economists’ 
contribution. 

Before concluding this review essay, we shall, finally, take a look at a new 
kind of economic sociology in the process of constituting itself today. 
Schumpeter’s idea that economic sociology is a meeting place for economists 
and sociologists seems confirmed by recent developments, during which we 
have seen many interesting contributions from sociologists (such as Harrison 
White and Mark Granovetter) as well as from economists (such as Oliver 
Williamson and Gary Becker). What the new tradition in economic sociology 
will look like more precisely is, however, not yet clear. There exist, as we 
shall see, several competing perspectives, and what happens during the next 
few years will probably be decisive. 
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THE TRADITION OF ECONOMIC SOCIOLOGY AMONG 
THE ECONOMISTS 


To map out the contribution by the economists to economic sociology is no 
doubt a very complex and demanding task. Not only is there an enormous 
amount of material to go through, but there is also the added difficulty of first 
specifying what one means by "economic sociology" and then extracting 
exactly this material. And depending on how one defines "economic sociolo- 
gy", one will obviously get different results, something which makes the 
whole thing even more problematic. To undertake an enterprise of this type is 
also a research project in its own right and not suitable for a review article. 
We shall therefore take a short-cut and follow Schumpeter's account of 
economic sociology in his massive History of Economic Analysis. This work 
is unique for our purposes in that it explicitly set itself the task of tracing not 
only the history of economic theory (defined as the analysis of "economic 
mechanisms") but also that of economic sociology (defined as the analysis of 
“economic institutions" —Schumpeter 1954c:14—21). Unfortunately, Schum- 
peter's work does not include a section on the institutionalists, and we shall 
therefore have to complement the History of Economic Analysis on this point. 
We shall also say a few words about some other economists' contributions to 
economic sociology during the twentieth century. 

Schumpeter's book on the history of economic analysis is divided into four 
parts, all of which cover fairly long time periods. The first part begins, for 
example, with the Greeks and ends in 1790; and Schumpeter here discusses 
the contributions to economic sociology by such people as Aristotle, Cantil- 
lon, and Adam Smith. Aristotle, Schumpeter says, looked at slavery, prop- 
erty, and the state, while Cantillon tried to introduce the class struggle into 
economic analysis as well as to analyze the role that markets have played in 
the emergence of the modern city. Schumpeter was not particularly impressed 
by Adam Smith's analysis of economic institutions. As a matter of fact, he 
clearly preferred the works by the scholastics, especially their analysis of 
property. Adam Smith had in Schumpeter's opinion unduly exaggerated the 
importance of the division of labor. Book III of The Wealth of Nations (*Of 
the Different Progress of Opulence in Different Nations") could have become 
a starting point for a "historical sociology of economic life"; but for some 
reason Smith chose not to develop his ideas on this topic (Schumpeter 
1954c:186-87). 

History of Economic Analysis is particularly useful in that it contains an 
authoritative discussion of the contributions to economic sociology by the 
major European economists during the nineteenth and early twentieth century. 
During the classical period (1790-1870), according to Schumpeter, little 
progress was made in economic sociology since most economists took the 
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existing economic institutions for granted. This was especially true for the 
British economists, many of whom were strongly influenced by utilitarian- 
ism. And in Schumpeter's opinion, utilitarianism had had a very negative 
impact on economic sociology: “As regards economic sociology, utilitarian- 
ism can only be described as a complete failure since its rationalistic concep- 
tion of individual behavior and of social institutions was obviously and 
radically wrong" (Schumpeter 1954c:409). The classical economists, for 
example, just assumed that the state must not intervene in the economy. They 
also assumed, without any discussion whatsoever, that there should be free 
competition. But behind all their assumptions there existed a specific reality, 
which was not openly acknowledged. By "corporations" the classical econo- 
mists usually meant medium-sized corporations; to them, the owner and the 
manager were always the same person; and so on. 

The classical economists also ignored the impact by the nation and by 
social class on the economy. It is true, Schumpeter argued, that we can find a 
class concept in, for example, Ricardo's work. But Ricardo used class in a 
purely theoretical and nonsociological sense. Indeed, one of the great in- 
tellectual mistakes of the classics was precisely that they had failed to develop 
a sociological concept of class: "they did not work out a theory of social 
classes, insert it into their economic sociology, and then construct economic 
categories for use in economic analysis: this procedure would have required 
an awareness of the problems involved from which they were far removed. 
Instead they took a short cut: they simply turned, with little modification, the 
social groupings that are known to the popular mind into categories of 
economic analysis" (Schumpeter 1954c:552). 

It is true, Schumpeter continued, that John Stuart Mill had tried to analyze 
the role of a few economic institutions, such as property and inheritance (see 
Mill 1987:218—37, 889-96). Schumpeter also pointed to Mill's famous analy- 
sis of the role of tradition in price-setting (Mill 1987:242—48). In the last 
analysis, however, Mill always judged institutions in terms of whether they 
favored or impeded a certain ideal; and this prevented him from developing a 
sophisticated economic sociology. The only economist during this period who 
was also a good sociologist was Karl Marx, according to Schumpeter. Marx 
had made two great contributions to economic sociology: his concept of social 
class and his theory of how economic factors influence the social structure and 
the social attitudes (what Schumpeter called Marx's "Economic Interpretation 
of History"). Schumpeter's analysis of Marx on this point may be com- 
plemented by the section on “Marx the Sociologist” in Capitalism, Socialism 
and Democracy, which contains a much fuller discussion of Marx’s contribu- 
tion than does the History of Economic Analysis (Schumpeter 1976:9—20). 
Schumpeter was very enthusiastic about Marx’s economic sociology; he says 
at one point that “the so-called Economic Interpretation of History is doubt- 
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less one of the greatest individual achievements of sociology to this day" 
(Schumpeter 1976:10). Schumpeter, however, also warns that Marx had 
pushed economic theory and economic sociology so close together that they 
practically merged. And this was unacceptable to Schumpeter: Economic 
theory should deal with economic mechanisms and economic sociology with 
the institutional framework; and these two tasks should be kept separate. 

During the third period which Schumpeter covered in History of Economic 
Analysis, 1870-1914, several important discoveries were made. Jevons, 
Menger, and Walras invented marginal utility analysis, and their work was 
continued by such outstanding scholars as Pareto, Wicksell, and Marshall. In 
terms of economic sociology, however, these years were a disappointment. 
Sociology took great strides forward, but none of this research was used by 
the major economists who simply took over the institutional assumptions from 
the classical economists. The result was the following: “Nations remained 
amorphous agglomerations of individuals. Social classes were not living and 
fighting entities but were mere labels affixed to economic functions (or 
functional categories). Nor were the individuals themselves living and fight- 
ing beings: they continued to be mere clotheslines on which to hang pro- 
positions of economic logic. And, with improving vigor of presentation, these 
clotheslines stood out even more visibly than they had in the works of the 
preceding period" (Schumpeter 1954c:885-87). 

It was also during the 1870—1914 period that American institutionalism was 
born. Schumpeter had originally planned to include a section in his book on 
the three founders of American institutionalism: Thorstein Veblen, Wesley 
Clair Mitchell, and John R. Commons. He died, however, before he got 
around to writing this section; we shall therefore have to leave Schumpeter at 
this point of our overview (see, however, Schumpeter 1950). It is often noted 
that the three founders of American institutionalism were quite dissimilar, 
both in terms of their interests and in their style of analysis. This is indeed 
true: Mitchell was mainly concerned with finance and business cycles, Com- 
mons with law and labor, while Veblen wrote on the most diverse topics such 
as the leisure class, the theory of the firm, and Imperial Germany (Veblen 
.1899, 1904, 1915; Commons 1919-1935, 1924; Mitchell 1941; for Veblen, 
see also Dorfman 1934). Still, the institutionalists do have certain things in 
common which set them apart from neoclassical economists and which show 
their affinity to economic sociology. There is first of all their attempt to 
develop a different and much broader type of economic theory—a theory 
which was to include not only economics but also elements from the other 
social sciences, such as psychology, history, and sociology. Veblen, Com- 
mons, and Mitchell were also very interested in analyzing change over time, a 
topic which was impossible to handle with marginal utility theory in the early 
twentieth century, but which lent itself very nicely to the sociological type of 
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analysis. Their interest in institutions was to a large extent an outgrowth of 
their fascination with change as well as of their attempt to deal with other 
units of analysis than those that are commonly used in neoclassical theory. 

The institutionalists' critique of mainstream economic theory was basically 
that it was too narrow and individualistic, and unable to deal with empirical 
reality. Mitchell, for example, felt that economic theory was a hindrance to an 
effective understanding of business cycles. What was needed was a 
straightforward empirical approach, as exemplified by the work of the Nation- 
al Bureau of Economic Research (which Mitchell helped to found in 1920). 
Commons said that economic theory should be able to deal with "collective 
economic behavior," not only with individual economic behavior. Economics 
would, for example, do well to incorporate the concept of "transaction" from 
legal science and to view economic action as a "process." As Commons 
(1924:7) phrased it in The Legal Foundations of Capitalism: “While the 
economists start with a commodity or an individual's feeling towards it, the 
court starts with a transaction. Its ultimate unit of investigation is not an 
individual but two or more individuals—plaintiff and defendent. . . . The 
transaction is two or more wills giving, taking, persuading, coercing, de- 
frauding, commanding, obeying, competing, governing, in a world of scarci- 
ty, mechanism and rules of conduct." This concept of transaction is, no 
doubt, very close in spirit to economic sociology (it was also to inspire some 
of the analysis in Oliver Williamson's Markets and Hierarchies, 1975). The 
most biting critique of marginal utility theory came, however, not from 
Commons or Mitchell but from Veblen, who argued that the neoclassical type 
of economic theory failed to incorporate the latest developments in psycholo- 
gy as well as history. As a result, the economic actor was reduced to 
something of a caricature. In an oft cited quote by Veblen: "The hedonistic 
conception of man is that of a lightning calculator of pleasures and pains, who 
oscillates like a homogenous globule of desire of happiness under the impulse 
of stimuli that shift him about the area, but leave him intact" (Veblen 
1898:389). 

The emphasis on change in the work of the American institutionalists is 
particularly clear in Veblen's argument that economics must become an 
evolutionary science. In Veblen's "evolutionary economics," institutions are 
defined as "settled habits of thought common to the generality of men," and 
this means that they are historically formed (Veblen as cited in Gruchy 1947, 
p. 68). To Veblen, it was simply inconceivable that one could analyze 
economic behavior without taking history into account and the changes that 
societies display over time. Commons was basically of the same opinion, as 
his massive History of Labor in the United States testifies (Commons 1918— 
1935). And so was Mitchell. Indeed, in Business Cycles the past and the 
present are organically connected to one another— "prosperity breeds crisis,” 
to cite a famous line by Mitchell (1941:153). Mitchell's concern with change 
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is also clear from his work as the chairman of the committee which in 1934 
published the famous interdisciplinary volume Recent Social Trends (Pre- 
sident's Research Committee on Social Trends 1934). 

To trace the contributions by economists to economic sociology from, let 
us say, the mid-1930s and onwards is a complicated task. It must first of all be 
noted that there was practically no communication between economists and 
sociologists after the triumph of neoclassical theory over the Historical School 
in Europe and over institutional economics in the United States. This means 
that practically all “sociological” work, which was done by economists during 
the rest of the twentieth century, was produced without any contact whatsoev- 
er with the sociological tradition. At the risk of simplifying things too much, 
we can say that economists have followed two different routes in their 
attempts to tackle sociological problems during the years after 1930. There is 
first of all the small minority of economists who have continued to work in the 
tradition of the original type of institutionalism. They basically explain the 
emergence and workings of economic institutions through a very broad 
conception of economic theory, which also includes a sociological perspec- 
tive. This older type of institutionalism has been kept alive since the 1930s 
through the works of a small number of well-known economists such as John 
Kenneth Galbraith, Gunnar Myrdal and, Raoul Prebisch (see e.g. Galbraith 
1952, 1955, Myrdal 1957, 1968, Prebish 1962, Samuels 1988). Then there 
are those economists who have developed a novel type of institutionalism 
known as “new institutional economics.” These latter explain the emergence 
and workings of economic institutions through the principle of maximizing 
utility. 

The new type of institutionalism has had much more of a following in 
mainstream economics. It started to bloom in the 1970s but can in some sense 
trace its intellectual roots to the 1930s, especially to Ronald H. Coase’s 
classic article “The Nature of the Firm” (1937). Coase here makes an argu- 
ment that lays the foundation for transaction cost economics (including Oliver 
Williamson’s Markets and Hierarchies), namely that the decision to use a 
market or, alternatively, a firm, depends on the transaction costs involved. 
Though Coase does not explicitly say that non-economic institutions can also 
be explained through the principle of maximizing utility, it is clearly only a 
short step to this position—a step which, however, would not be taken until 
some time after World War II (see e.g. Coase 1960). Though we now leave 
new institutional economics, we return to it later. 


THE SOCIOLOGICAL TRADITIONS IN ECONOMIC 
SOCIOLOGY 


Unfortunately no sociological equivalent to Schumpeter’s History of Eco- 
nomic Analysis exists; and the history of economic sociology, as developed by 
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sociologists, is still very much virgin territory (but see Swedberg 1987). 
Three important attempts have been made in mainstream sociology to create a 
viable economic sociology: in Germany, France, and the United States. The 
attempts in Germany and France paralleled each other in time but were 
nonetheless independent of one another. They both started in the 1890s, 
culminated just before World War I, and died off in the 1930s. The attempt in 
the United States, on the other hand, began first in the 1950s and petered out 
in the 1960s. Each of these three traditions in economic sociology display a 
different profile. They are all fairly unexplored and, more importantly, they 
are all relatively unexploited. This is especially true for the richest of them all, 
the German tradition. 


The Tradition of German Economic Sociology (1890—1930) 


What characterizes the Wirtschaftssoziologie, which emerged around the turn 
of the century, is first of all that it was profoundly historical and comparative 
in nature. An attempt was also made in this type of economic sociology to 
show that the study of economics belongs to the “cultural sciences," which 
basically means that an interpretive approach had to be used. In terms of the 
objects it studied, German economic sociology focussed primarily on two 
topics: economic development and the role of the state in the economy. In 
both areas it left behind magnificent achievements, such as the works by 
Weber, Sombart, and Schumpeter. 

That German economic sociology was historical and comparative in nature 
primarily means that it drew much of its inspiration from historical research 
and that it made use of comparisons—comparisons between institutions, 
between nations, and between civilizations. The idea that economics is not a 
natural science but a "cultural science" and therefore must deal with culturally 
constructed meanings, came from philosophy, especially Dilthey. According 
to Weber (1975:151—52), it was an economist called Friedrich Gottl who first 
stressed that Verstehen must be systematically introduced into economics. To 
avoid any misunderstandings, it should be noted that what people like Weber 
and others had in mind was not so much subjective opinions about some 
aspect of economic life (as in, say, contemporary survey research), but 
culturally constructed meanings. These could take the form of institutions but 
could also be embodied in the "spirit" of a whole economic system. The two 
major topics that the German economic sociologists chose to focus on— 
economic development and the role of the state in the economy (e.g. Hoselitz 
1960, Bücher 1968)—were both much related to the problems surrounding 
the industrialization of Germany. 

German economic sociology owes much of its strength to the Historical 
School in Economics and to Marx’ masterful analysis in Capital (Marx 1967). 
The pioneeer in this whole development, however, was Friedrich List, who 
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introduced the notion of nation into economics in 1841 in The National 
System of Political Economy. Schumpeter (1954b:100) writes: “In this context 
List's contribution towards an economic sociology is of first importance, 
[especially—RS] his conception of the natural economy in the setting of its 
historical causes as the embodiment of historically unique circumstances." A 
few years later, in 1843, Wilhelm Roscher (1895:101) took the important step 
of announcing that economics must use "the historical method." "It will be 
seen," he wrote, “that this method aims at much the same result for political 
economy as the method of Savigny and Eichhorn has attained in ju- 
risprudence" (Roscher 1895:103). Roscher emphasized that a comparative 
approach must be used as part of the historical method. Schmoller (1952:375) 
has summarized. Roscher's methodology in the following way: "There is 
everywhere the same aim and the same method: a number of collective 
entities and their characteristics are compared and placed side by side. They 
are ordered in historical series. The attempt is made to identify the causes of 
institutions, which prevailed under parallel circumstances. This should lead to 
laws of historical development. and produce benchmarks for the appraisal of 
economic policies and of old and modern institutions." (For a.critique of 
Roscher’s methodology—especially his idea that it was possible to establish 
economic laws—-see Weber 1975). E 

The greatest achievements in historical economics cannot, however, be 
attributed to the so-called Older Historical School (whose three key members 
were Roscher, Knies, and Hildebrand). These achievements were instead 
produced by Gustav von Schmoller and his followers, who soon became 
known as the "younger historical school." In the 1880s Schmoller got 
embroiled in a very bitter fight with Carl Menger—the so-called Methoden- 
streit—which he lost and which has cast a dark shadow on Schmoller's 
accomplishments ever since (see e.g. Bostaph 1976, JITE 1988). As opposed 
to what is commonly thought, however, Schmoller was not hostile to eco- 
nomic theory and was exclusively interested in historical details. As 
Schumpeter (1926) has pointed out in one of his key essays, written to 
rehabilitate the founder of the Younger Historical School, much of what 
Schmoller wrote actually contains a healthy mixture of analytical generality 
and historical facts. Indeed, this mixture is also typical of Wirtschaftsso- 
ziologie; and, according to Schumpeter (1926:181), “Schmoller’s whole life- 
work was to a large extent devoted to the founding and development of . . . 
economic sociology." Schumpeter also stressed that no other economist had 
taken the task so seriously as Schmoller concretely to describe and analyze the 
economic process. Others may have proclaimed this to be their goal, but only 
Schmoller made a serious attempt actually to carry this program out. Schmol- 
ler gathered statistics and historical documents; he produced specialized 
monographs and he tried to synthesize all of his immense knowledge in a 
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general treatise, Grundriss der Allgemeinen Wirtschaftslehre (see especially 
Schmoller 1900; see also Hintze 1964). 

It should be noted that German economic sociology to a large extent 
emerged as an attempt to close the gap in the Methodenstreit between the 
historically and the theoretically oriented economists. The key idea, most 
forcefully articulated by Weber, was that economic sociology could serve as a 
bridge between these two positions by drawing simultaneously on both histor- 
ical research and on analytical theory. Between 1890 and 1930 this project 
attracted some of the most creative social scientists in Germany, such as Karl 
Bücher, Otto Hintze, Joseph A. Schumpeter, George Simmel, and Werner 
Sombart. From the very beginning, however, Max Weber was the towering 
figure: it was he who invented the field of Wirtschaftssozologie and it was he 
who made the most important contributions to it. 

In contemporary sociology relatively little attention has been paid to Web- 
er's economic sociology, apart from The Protestant Ethic and the question of 
the relationship between economic theory and Weber's sociology. The debate 
on The Protestant Ethic has usually centered on the question whether Weber 
meant that protestantism had "caused" capitalism or whether he had a more 
subtle interpretation in mind (for a review of the literature see e.g. Marshall 
1982). And the studies of the relationship between economic theory and 
Weber's sociology have very much been preoccupied with the influence of 
neoclassical theory and/or historical economics on Weber's work (e.g. Clarke 
1982:186—242; Hennis 1987; Schón 1987; Holton & Turner 1989; but see 
Osterhammel 1987). Little attention, however, has been directed at Weber's 
economic sociology per se (see, however, Parsons 1947, Collins 1986). This 
is a great pity because huge parts of Weber's work contain extremely valuable 
contributions to, economic sociology. Throughout his career Weber, for ex- 
ample, made studies of such topics as medieval trading corporations, the 
stock exchange, the economic history of Antiquity, the Historical School in 
Economics, German agriculture, the productivity of industrial workers, and 
the economic ethic of the major religions (see especially Weber 1975, 1976, 
1978, 1979, 1980, 1988a,b,c). 

To outline the main gist of these studies and their relevance for economic 
sociology cannot be done in this overview. What must be mentioned, howev- 
er, is Weber's attempt to lay a theoretical foundation for economic sociology. 
He did this during the last few years of his life, in the text which today is 
contained in chapter 2 of Economy and Society, "Sociological Categories of 
Economic Action" (Weber 1978:63—211). When Weber lectured on this topic 
in the late 1910s, however, the students found it far too abstract. Weber 
therefore decided to give a general course in economic history in 1919-1920 
in order to give some "flesh and blood" to his concepts in economic sociolo- 
gy. This course was later translated into English (by Frank Knight) as General 
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Economic History and represents, together with chapter 2 in Economy and 
Society, an indispensible guide to Weber's economic sociology (Weber 
1981). 

Chapter 2 in Economy and Society and in General Economic History 
complement one another nicely and should also be read together. The main 
theoretical categories are the same, even if they are presented in different 
ways: in a dry, systematic manner in Economy and Society and as part of a 
historical unfolding in General Economié History. Two key themes, present 
in both works, are that economic sociology must be interpretive, and that 
power plays a key role in the economy. In Economy and Society Weber 
(1978:64) stresses that "all *economic' processes and objects are characterized 
as such entirely by the meaning they have for human action." And in General 
Economic History Weber exemplifies this approach by observing: "There is 
great difference between the fact that a Chinese or Japanese peasant is hungry 
and knows the while that the Deity is unfavorable to him or the spirits are 
disturbed and consequently nature does not give rain or sunshine at the right 
time, and the fact that the social order itself may be held responsible for the 
crisis, even to the poorest laborer. In the first case, men turn to religion; in the 
second, the work of men is held at fault and the laboring man draws the 
conclusion that it must be changed" (Weber 1981:292). In Economy and 
Society Weber (1978:68) says that the notion of economic power is "in- 
dispensible" to economic sociology (the exact term Weber uses is Ver- 
fügungsgewalt or “powers of control and disposal"). According to Weber, a 
number of important economic phenomena—such as profitmaking, consumer 
behavior, and so on—can only be understood if the role of economic power is 
properly taken into account. His analysis also contains many brilliant socio- 
logical observations on economic behavior in different civilizations (mainly 
China, India, and Europe), on different types of economic ethics (e.g. 
"internal ethics" versus "external ethics"), and on different economic roles 
(e.g. "the alien trader" versus "the resident trader"). It should finally also be 
mentioned that both works contain a sophisticated attempt to develop a 
sociology of markets. 

Many of the themes that Weber touched on in his economic sociology were 
also discussed by other social scientists, who were either influenced by Weber 
or became interested in them on their own. The attempt to determine what had 
“caused” capitalism characterizes, for example, Werner Sombart's giant work 
Der moderne Kapitalismus (Sombart 1987). Sombart's work in economic 
sociology is marred by the author's eccentric scientific style but is also filled 
with flashes of great insight (see e.g. Hintze 1975). In Die drei Nationaló- 
konomien Sombart (1930), for example, made an interesting attempt to 
outline the methodology of “an interpretive economics" ("verstehende 
Nationalökonomie” ). The relationship of the state to the economy plays a key 
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role in German economic sociology and also in Otto Hintze's work, especially 
in his fine study of mercantilist policy in eighteenth century Prussia (Hintze 
1987). While Hintze had primarily been concerned with tbe role of the state in 
the construction of national markets, Schumpeter tried to analyze its budget 
and its involvement in various imperialist ventures (Schumpeter 1954a, 
1971). In Capitalism, Socialism and Democracy one can also find an attempt 
by Schumpeter—inspired more by Marx than by Weber—to sketch the 
structural contradictions of the capitalist system (Schumpeter 1976). Two 
other classics in economic sociology from these years are Karl Bücher's 
Industrial Evolution and Simmel's The Philosophy of Money (Bücher 1968, 
Simmel 1978). With its theory of stages and.engaging institutionalist account, 
Bücher's work was a great success, not only in Germany but also in the 
United States. Simmel’s work, finally, contains an important contribution to 
the sociology of money, especially through its pioneering analysis of the role 
of trust in the economy (see in this context also Schumpeter 1970). 


The Tradition of French Economic Sociology (1890—1930) 


There exist many similarities betwen French economic sociology and German 
economic sociology. Both emphasized the role of institutions in the economy, 
and both stressed that the method of economic sociology must be comparative 
and historical. Both also assigned a central place to the cultural-symbolic 
dimension: in German economic sociology, through the concept of Verstehen, 
and in French economic sociology, through Durkheim's notion of "collective 
representations." "Economic phenomena," to cite Marcel Mauss (1967:123), 
"are not only material phenomena; they are also collective representations 
which dictate the attitudes of the members of society towards material ob- 
jects.” But some important differences clearly exist between the two traditions 
as well. While theorists like Weber, Sombart, and Schumpeter spoke about 
"capitalism" and its struggles, the group around Durkheim preferred to talk 
about "industrial society" and its lack of cohesion. And while in Germany the 
sociologists tried to find a place for economic sociology alongside economic 
theory (or at least not in open opposition to the economists), the Durkheim- 
ians fought the economists and wanted to replace economic theory with 
economic sociology. 

These differences are to a large extent due to the fact that German economic 
sociology and French economic sociology come from different intellectual 
traditions. While the former grew out of the Historical School in Economics 
and Marxism, French economic sociology was primarily influenced by Saint- 
Simon and Comte. Saint-Simon (1966) was the first to analyze "industrial 
society;” indeed it was he who coined or-at least popularized this term. His 
portrayal of contemporary European society as in a painful transition from 
feudal society to industrial society was extremely influential. It inspired, for 


MAJOR TRADITIONS OF ECONOMIC SOCIOLOGY 263 


example, Comte’s famous “Jaw of three stages" and Durkheim's Division of 
Labor in Society with its analysis of modern society as anomic and without 
rules to regulate the economy (Durkheim 19642). To deal with the problem of 
industrial society, Comte argued, a general theory of society was needed as 
opposed to economic theory, which dealt with only one sector of society. His 
Cours de Philosophie Positive, a very influential work, contains a scathing 
attack on economic theory for being scholastic, ideological, and sterile 
(Comte 1869:193—204; see also Maudit 1928). This critique was later echoed 
by Durkheim, most prominently in Rules of Sociological Method but also in 
other writings (e.g. Durkheim 1887, 1908, 1964b, 1970; Aimard 1962). 
Frangois Simiand, whom Durkheim appointed to be in charge of the section 
on "sociologie économique" in Année Sociologique, also made many attacks 
on economic theory. "The mathematical economists," as he wrote in La 
Méhode Positive en Science Economique, “do not hesitate to call what they do 
for science, even before they have controlled if their hypothetical con- 
structions are supported by the facts; even if they are not supported by the 
facts; and even if they are contradicted by the facts. It is not too much to say 
that this constitutes a methodological scandal since we are dealing with a 
science which after all pretends to explain reality" (Simiand 1912:130). 

The major French economic sociologists consisted of Durkheim and his 
collaborators, first of all Mauss and Simiand but also Halbwachs and the 
Bourgin brothers (see e.g. Bourgin 1924, Halbwachs 1940). Durkheim him- 
self made a suggestive analysis of the origin of the respect for property 
(tracing it to fear inspired by religious taboos), but was otherwise not particu- 
larly interested in economic sociology (Durkheim 1983). Marcel Mauss' The 
Gift is a brilliant study, not only of gift-giving but of exchange in general and 
of the role that the economy plays in preindustrial societies (Mauss 1969). At 
one point Mauss started to study the origin of money, but he never completed 
this work (Mauss 1974; see Simiand 1934 for an elaboration of Mauss's ideas 
on this topic). Like Durkheim, Mauss believed that many modern economic 
institutions have their origin in religious behavior and that economic sociolo- 
gy therefore needs to be complemented by anthropological research. Scattered 
throughout Mauss' work one can also find the most interesting comments on a 
variety of economic topics. At one point Mauss, for example, suggests that 
money can be seen as a way of "adjusting" the process of production and 
consumption to one another; and at another, that the essence of the economy 
consists in "collective waiting" (Mauss 1967:132; 1974:117-18). 

Frangois Simiand is little read today but exerted, interestingly enough, an 
important influence on the Annales School through his insistence that social 
science should deal with the long waves of history rather than with single, 
spectacular events. Simiand (1932) made this point most forcefully in a giant 
statistical study of French salaries between 1790-1930, entitled Le Salaire, 
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l'Evolution Sociale et la Monnaie. Simiand's own brand of economic sociolo- 
gy—which he referred to as "the positive method in economics” (e.g. 
Schumpeter 1952, Besnard 1983, Cedronia 1987)—differed radically from 
the work of the Annales School, primarily through its insistence that all 
subjective elements should be excluded from the analysis. But as Marc Bloch 
(1934:30) has noted a propos Simiand: "to only see numbers as ‘objective’ 
would . . . restrict economic sociology in a fatal way." Indeed, some of the 
very best works on the economy, which have been produced by the members 
of the Annales School, have drawn more on Durkheim's notion of "collective 
representations" than on Simiand's "positive method" (e.g. see Duby 1974, 
Braudel 1985). 


The US Tradition of "Economy and Society" 


The development of economic sociology in the United States has been quite 
different from that in Germany and France. No vigorous economic sociology 
emerged around the turn of the century; and no major works on the economy 
were produced by the founding fathers of American sociology. Indeed, it was 
not until the 1950s that a serious attempt was made to launch economic 
sociology as a distinct research area. And even when this was done, a certain 
hesitation was involved: one rather spoke of “economy and society" than of 
"economic sociology" (e.g. Moore 1955, Parsons & Smelser 1956). Why 
economic sociology had such difficulty coming into being in the United States 
is not clear. One reason was probably the hostility (later replaced by in- 
difference) of the economists. In the 1890s the American economists, for 
example, made it clear to the sociologists that they should stay away from 
economic topics and exclusively focus on "social" topics, such as marriage, 
deviance, and poverty (e.g. Furner 1975). The alienation that many econo- 
mists felt toward sociology was further intensified in the 1940s when mathe- 
matical economics became popular, and the last of what was still “social” was 
squeezed out of economics. 

Several interesting contributions were nevertheless made to economic 
sociology during the pre-1950 period. Around the turn of the century there 
developed, for example, a lively debate between economists and sociologists 
about the relationship between the two sciences (e.g. Giddings 1888, 1895, 
Ward 1899). Of the early sociologists, it may also be noted, Charles Horton 
Cooley was interested in economic topics. Cooley got a PhD in economics in 
1894 and his thesis, The Theory of Transportation, is an imaginative contribu- 
tion not only to human ecology but also to economic sociology (Cooley 
1930b). What makes Cooley's analyses of economic topics particularly in- 
sightful is his insistence on the centrality of communication in the economic 
process. The contemporary sociologist has also much to learn from Cooley's 
analysis of such topics as the market and the process of evaluation (see Cooley 
1918, 1930a, 1930b). 
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In mentioning Cooley's analysis of transportation it should be added that a 
steady stream of work on transportation and communication was produced in 
the social ecology movement in the first half of the twentieth century. Some 
of this work was carried out by people connected to the Chicago School, such 
as Robert Park and W. F. Ogburn (see e.g. Park 1950, Ogburn 1946). On the 
whole, however, the Chicago School did not contribute very much to eco- 
nomic sociology (Swedberg 1987:54—57). There were basically two reasons 
for this: the Chicago sociologists were not very interested in economics, and 
they respected the social division of labor between economists and sociolo- 
gists, which assigned “economic” problems to the economists and “social” 
problems to the sociologists (for an exception, see e.g. Ogburn 1964:349— 
60). The Chicago sociologists did, however, make important contributions to 
two new subfields in sociology, which are of interest to economic sociology, 
namely the sociology of professions and industrial sociology. Everett C. 
Hughes practically created the former subfield on his own (see Hughes 1971 
as well as his imaginative analysis of the real estate business in Hughes 1979). 
Industrial sociology, on the other hand, has its origins in the studies by Elton 
Mayo and his Harvard associates of the Hawthorne Works of the Western 
Electric Company in the late 1920s and 1930s (see Roethlisberger & Dickson 
1939, Homans 1950). But even if it was Mayo's work that laid the foundation 
for much later work in industrial sociology on productivity and morale, it was 
the Chicago sociologists who added the technique of participant observation 
as well as the recognition that the relationship between workers and manage- 
ment is often conflictual (see e.g. Blumer 1947, Whyte et al 1955). 

Another person who made an interesting contribution to economic sociolo- 
gy during the pre-1950 period was Talcott Parsons. His doctoral dissertation 
from 1927 was Der Kapitalismus bei Sombart und Max Weber (Parsons 
1927). At the time when Parsons wrote his thesis, he was still an in- 
stitutionalist. Later, however, he became convinced that institutionalism 
represented a dead-end and that neoclassical theory was far superior. In a 
series of articles, which culminated in The Structure of Social Action, Parsons 
argued that while economists look at reality from one particular viewpoint 
(that of "alternative uses of scarce means to the satisfaction of wants"), 
sociologists look at it from another viewpoint (that of values or "ultimate 
common ends"——Parsons 1934:526-29). Parsons called this new approach 
*the analytical factor view," and it represents to a certain extent an early 
version of the economy-and-society perspective (see also e.g. Parsons 1932, 
1935). 

During the 1950s the debate about economy and society became much 
more lively in the United States; and if this were a book rather than a review 
article, it would have to include discussions of Karl Polanyi's work in 
economic anthropology, George Katona's work in economic psychology, and 
Herbert Simon's work in behavioral economics—as well as analyses of 
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modernization theory and industrial sociology in the 1950s (see e.g. Katona 
1951, Polanyi et al 1957, Simon 1957, Schneider 1957, Dalton 1959, Hose- 
litz 1960). A self-conscious economic sociology made its first appearance in 
the United States at this time. The first stirrings came in 1951 when Francis 
X. Sutton, a student of Parsons, and James Duesenberry, a young economist 
who had become interested in sociology through Hans Gerth at Wisconsin, 
started to teach a course at Harvard called "Sociological Analysis of Eco- 
nomic Behavior" (see e.g. Duesenberry 1949, 1950). Sutton soon galvanized 
some other economists into action, such as Carl Kaysen, James Tobin, and 
Seymour E. Harris. As a result of their cooperative efforts, The American 
Business Creed would be published in 1956 (see Sutton et al 1956). The basic 
thesis of this work is that the "creed" or ideology of American businessmen is 
largely a result of the strain in their social role (for a critique, see e.g. Geertz 
1973). Parsons' first contribution to this new field came in 1953 when he 
delivered the Marshall Lectures in Cambridge (see Parsons, forthcoming). 
The thesis of the book that eventually resulted from these lectures—Economy 
and Society, co-written with the young Neil Smelser—differed considerably 
from Parsons' earlier "analytical factor view" (Parsons & Smelser 1956). 
Economic theory, Parsons & Smelser now argued, constituted a special case 
of "the general theory of the social system," and the economy itself was seen 
as a "subsystem" of its own (for a critique of Parsons’ economic sociology, 
see Pearson 1957 and Gerschenkron 1962). 

Economy and Society was no immediate success, and Parsons seems more 
or less to have lost interest in economic sociology after its publication (for his 
later work on money as a "generalized medium," see e.g. Ganssmann 1988). 
Neil Smelser, however, continued for some years to work in the new field. In 
his thesis, Social Change in the Industrial Revolution, Smelser drew attention 
to the importance of change in the sociological analysis of the economy 
(Smelser 1959). In 1963 he also published the first textbook in economic 
sociology and in 1965 the first reader (see Smelser 1963, 1965). After that 
Smelser turned his attention to other topics. 


NEW ECONOMIC SOCIOLOGY 


During the 1960s the separation between economics and sociology in the 
United States reached its peak: sociologists were very little interested in 
economic topics and economists stayed away from social topics. This artifi- 
cial division of labor began to break down in the 1970s; and since that time, a 
tentative dialogue between economists and sociologists has started up (see 
e.g. Swedberg 1990). Exactly what the result of this dialogue will be is 
not clear. Some economists feel, for example, that certain parts of economics 
can be improved with the help of sociology, while others think that many 
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problems in sociology can be solved by using an economic approach (see e.g. 
Akerlof 1984, Olson 1965, 1982, Becker 1976). Also the sociologists have 
their differences. Such scholars as James Coleman and Michael Hechter agree 
with economists such as Gary Becker and Mancur Olson that a rational choice 
perspective is needed in sociology (e.g. Hechter 1987, Coleman 1990). This 
is a proposition that, for example, Amitai Etzioni has criticized very severely 
(Etzioni 1988). Some sociologists—perhaps a majority—however, are stimu- 
lated by the prospect of a mixture of bóth economic and sociological per- 
spectives (see e.g. Winship & Rosen 1988, Fligstein & Dauber 1989). This 
is, for example, true for the situation in today's labor market sociology and 
stratification theory (see e.g. Berg 1981, Farkas & England 1988). 

Of all that is happening in the border area between economics and sociolo- 
£y, two approaches are of particular interest to economic sociology and can be 
said together to constitute a new economic sociology. These two approaches 
are sometimes referred to as "new institutional economics" and "new sociolo- 
gy of economic life"; and a few words shall be said about each of them. New 
institutional economics has basically been developed by economists, although 
a few sociologists have also expressed their interest. The key idea is that 
microeconomics can be used to explain social behavior and social institutions. 
In principle, this approach is considerably broader than economic sociology, 
but in practice it often deals with this field as defined by Schumpeter 
(economic sociology being the analysis of "institutions that are relevant to 
economic behavior, such as government, property, inheritance, contract, and 
so on" —Schumpeter 1954c:21). It must, however, immediately be added that 
new institutional economics has little if anything in common with American 
institutionalism as developed by Veblen, Commons, and Mitchell. As an 
intellectual movement, new institutional economics has its roots in the 1950s, 
although it did not really come into its own until the 1970s. Between 1950 and 
1970 it was tried out on various legal, political and social institutions. In the 
1970s, however, some economists also began to propose provocative new 
theories of organizational behavior with the help of such concepts as 
asymmetric information, transaction costs, and agent-principal (see e.g. Wil- 
liamson 1975, Barney & Ouchi 1986; for a game theoretical approach, see 
Schotter 1981). Each of these theories explains the emergence and/or struc- 
ture of organizations somewhat differently. In agency theory the emphasis is 
on how the principal can control and direct the agent, while in transaction cost 
economics, it is on how to minimize transaction costs. Usually, however, all 
of these approaches agree that efficiency is central to the choice of one 
organizational form rather than another. This sharply distinguishes new in- 
stitutional economics from the new sociology of economic life, which claims 
that the notion of efficiency cannot be used to explain the existing institutions, 
in the economy or elsewhere (e.g. Granovetter 1985, Oberschall & Leifer 
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1986, Bradach & Eccles 1989, for some self-doubt among the economists, 
see e.g. Furubotn & Richter 1989). 

Sociologists were considerably slower than economists in responding to the 
challenge of a new division of labor between economics and sociology. While 
the term "new institutional economics" became popular in 1975 (through 
Oliver E. Williamson's Markets and Hierarchies), it was not until 1985 that 
the term “new sociology of economic life" was first used (by Mark Granovet- 
ter at a luncheon roundtable discussion at the ASA). In 1985 a key article in 
the new economic sociology was also published: Mark Granovetter’s “Eco- 
nomic Action and Social Structure: The Problem of Embeddedness." True, it 
was Harrison White who had given the initial impulse to revive economic 
sociology through a series of pioneering studies of markets in the late 1970s 
(see White 1981; see also Barber 1977, Stinchcombe 1983). Still, Granovet- 
ter's article was immediately recognized as a trendsetter by economists as 
well as sociologists. For one thing, the article contained a sophisticated and 
elegant argument for the use of networks in the analysis of the economy. 
“Networks of social relations," we read, “penetrate irregularly and in different 
degrees . . . different sectors of economic life" (Granovetter 1985:491). The 
networks approach can help us to understand the role of trust in the economy 
and how economic institutions work in reality, as opposed to in economic 
theory. Secondly, Granovetter's article contained a very knowledgable and 
sharp critique of new institutional economics. Granovetter basically argued 
that the economists’ idea that the existing economic institutions have come 
into being because they represent the most efficient solution, is simplistic and 
wrong: “the main thrust of the “new institutional economists" is to deflect the 
analysis of institutions from sociological, historical, and legal argumentation 
and show instead that they arise as the efficient solution to economic pro- 
blems. This mission and the pervasive functionalism it implies discourage the 
detailed analysis of social structure that I argue is the key to understanding 
how existing institutions arrived at their present state" (Granovetter 
1985:505). And, third, Granovetter's article strengthened confidence that 
sociologists could solve a number of problems that by tradition only the 
economists had done work on. As he phrased it in an interview from about the 
same time: "I think that economic sociologists can go in and do wonderful 
things today . . . I think that the place to look is the place that sociologists 
have typically avoided looking, namely at all those kinds.of arguments and all 
those kind of subject matters that have been looked at almost exclusively up 
till now by economists" (Granovetter 1987:19). 

Since the early 1980s American sociologists have looked at a number of 
economic institutions, such as insurance, finance, transfer pricing, the organi- 
zation of salespeople, and so on (see e.g. Abolafia 1984, Eccles 1985, 
Heimer 1985, Baker 1987, Biggart 1989). Interesting work has also been 
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done on economics as a discourse and on the economic situation in socialist 
countries (Nee & Stark 1989, Block 1990). If one were to summarize the 
major intellectual innovations in economic sociology during the 1980s one 
could say that they have been centered around three major themes or topics: 
(a) the role of -networks in the economy (especially in markets), (b) the 
structure of different economic organizations, and (c) the role of culture in 
economic life. Interesting contributions have no doubt also been made to 
areas that partially overlap with economic sociology.but which have their own 
distinct traditions. This is particularly true for labor market sociology, which 
today is in a very creative stage, and stratification theory, which is in- 
creasingly turning toward economic insight and approaches (see e.g. the 
literature cited in Kalleberg & Sgrensen 1979, Gagliani 1987, Farkas & 
England 1988, Althauser 1989). 

The network approach became popular at Harvard in the 1970s through the 
teaching of Harrison White, who was also very interested in economic topics 
(see e.g. White 1970). White’s students, however, have been very in- 
dependent and used the network approach in their own distinct ways. 
Granovetter, for example, has analyzed the role that networks of information 
play when people get jobs (Granovetter 1974); Michael Schwartz has looked 
at interlocks between banks and corporations (e.g. Mintz & Schwartz 1985); 
and Robert Eccles has studied buyer-seller interactions between and within 
firms (e.g. Eccles 1981; see also Eccles & Crane 1988). Ronald Burt (who is 
a student of James Coleman, but influenced by Harrison White as well) has 
produced several studies of markets and is currently involved in developing a: 
general theory of competition (Burt 1983). The latter work is centered around 
the idea that certain individuals ("tertius gaudens") exploit "structural holes” 
in networks (Burt, forthcoming). Other important networks studies include 
Donald Palmer's work on "broken ties" and Wayne Baker's analysis of buyers 
and sellers in a national securities market (Palmer 1983, Baker 1984). 

Some of the sociologists who have studied networks have also been in- 
terested in the structure of economic organizations. This is, for example, true 
for Robert Eccles (1981) who has developed the notion of “quasi-firm” (a 
network type of relation with a high degree of stability) and Walter Powell 
(1985) who has drawn attention to a similar phenomenon, called "hybrid 
organizational arrangements." But other sources of inspiration exist besides 
network analysis; quite a few studies in economic sociology have, for ex- 
ample, emerged as a reaction to Williamson's transaction cost-theory and 
Chandler's work on the multidivisional firm (e.g. Fligstein 1985, 1990, 
Stinchcombe & Heimer 1985, Bradach & Eccles 1989). There also exist 
excellent works in industrial sociology and organization theory to draw on in 
this context (see especially Hirsch 1972, Stinchcombe 1959, 1974). 

A few sociologists have also raised the question of the role of culture in 
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economic life. This has been done most forcefully by Viviana Zelizer in her 
studies of what she calls “the historical interplay between human values and 
the market" (Zelizer 1983:ix; for a more programmatic statement on eco- 
nomics and culture, see Zelizer 1988). As of today Zelizer has studied three 
topics from this perspective: life insurance, the social value of children, and 
special types of money (Zelizer 1983, 1985, 1989). Through this focus on 
human values and the market, coupled with an interesting use of qualitative 
and historical material, Zelizer has added a new and exciting dimension to the 
economic sociology of the 1980s. How far one can go with a cultural 
perspective in economic sociology is, however, not clear; and arguments for 
as well as against relying mainly on a cultural explanation can already be 
found (see e.g. Hamilton & Biggart 1988, DiMaggio 1990). That economic 
sociologists need to take culture into account, and that economic anthropolo- 
gy has much to teach on this account is, however, obvious (for an overview of 
recent economic anthropology, see Orlove 1986). 

Most observers would probably agree that the new sociology of economic 
life has breathed vigor and life into a topic that was leading a rather bleak 
existence in the 1960s and the 1970s. A number of important studies—on new 
topics and with the help of new methods—were produced in the 1980s. Yet, 
some weaknesses also appear in this new form of economic sociology. For 
one thing, sociologists are still somewhat loath to take on traditional eco- 
nomic topics, and as a result we today lack a sophisticated sociology of 
money and of markets. Second, far too little comparative work has been done 
(see, however, Hamilton & Biggart 1988, Stinchcombe 1983). Third, it is 
unclear how to assimilate the insights from such independent intellectual 
traditions as world systems theory, institutional economics, feminism, and 
neo-marxism (see e.g. the literature cited in Wallerstein 1974—1989, Chirot & 
Hall 1982, Swedberg 1987:65-104, Hodgson 1989). And last but not least, 
little attempt has been made to mine the riches of economic history, from the 
superb German works from the nineteenth century to the studies by the 
Annales School of later date. Some of these weaknesses are no doubt due to 
the limitations inherent in the conventional view that economic sociologists 
shall only deal with peripheral aspects of the economy and stay in their own 
corner of academia. Hopefully, the time will soon come when sociologists 
directly challenge this doctrine, attack economic topics straight on, and open 
up a dialogue not only with economists but also with economic historians. 

Finally, it should be emphasized that in the last part of this article, which 
has been devoted to the most recent developments in economic sociology, we 
have focussed exclusively on the situation in the United States. There are 
some good reasons for this, especially that the two major types of new 
economic sociology have developed most forcefully here. Still, it is also true 
that much interesting work has been produced in other parts of the world. 
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European economists who are interested in economic institutions have, for 
example, for a long time found a platform in the Journal of Institutional and 
Theoretical Economics (JITE), which is published in West Germany. And if 
we turn to sociologists, it is clear that many interesting European and Third 
World sociologists have been active in International Sociological Associa- 
tion's Economy and Society—section (for a sample of this committee's work, 
see e.g. Makler, Martinelli and Smelser 1982, Martinelli & Smelser 1990). It 
is also true that some of the most creative sociologists in continental Europe 
have been interested in the intersection of economy and society during the 
1980s (see e.g. Boltanski 1982, Boltanski & Thévenot 1987, Luhmann 1982, 
Bourdieu 1984, 1990). Finally, there is currently a very strong interest in 
economic sociology in Eastern Europe and the Soviet Union (see e.g. Lengyel 
forthcoming). All in all, we can therefore conclude this overview by saying 
that economic sociology has definitely come alive again and that its future 


looks bright. 
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Department of Sociology, University of California at Los Angeles. 
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tions by a set of core studies and categorize them with respect to data 
collection, measurement, analytical models, research problems, main hypoth- 
eses, and substantive results. We go on to discuss a number of new topics and 
approaches that have gained prominence in the research agenda in the last 
decade. We conclude that the field has progressed considerably with respect 
to data collection and measurement; that shifts across generations with respect 
to data analytic and modelling strategies do not unambiguously represent 
advances; and that with respect to problem development and theory formula- 
tion the field has become excessively narrow. 


INTRODUCTION 


The study of the transmission of socioeconomic advantage from generation to 
generation is one of the core problems in sociology. From the turn of the 
century, empirical material has been collected on this topic (Perrin 1904). 
From the outset, cross-national and cross-temporal comparisons have had a 
central role, since such comparisons provide the only way to determine 
whether, to what extent, and in what ways the intergenerational transmission 
of advantage is dependent upon other aspects of social organization, and what 
its consequences are. 

The history of intergenerational stratification research is commonly divided 
into three generations (Featherman et al 1974): a first (post-war) generation of 
broad social stratification studies using relatively simple statistical tech- 
niques, and in which occupational mobility figured as only one issue among 
many; a second generation dominated by path models of educational and 
occupational status attainment; and a third generation dominated by loglinear 
models of occupational mobility. The three generations differ most sub- 
stantially with respect to (a) methods of data collection, (b) measurement 
procedures, and (c) methods of data analysis. Development has been more 
gradual with respect to (d) the definition of research problems and (e) the 
specification of major hypotheses. These five dimensions will be the lines 
along which we identify significant developments. 

We are well aware that the three generations are not distinct with respect to 
all five dimensions, nor are they very clearly separated in time. Nevertheless, 
it remains instructive to review the history of this field by characterizing each 
generation by a core of exemplary studies and by considering the successive 
generations in developmental perspective. We are relatively brief in our 
discussion of the first and the second generations, since they have been dealt 
with elsewhere (Hazelrigg 1974, Mayer 1979, Matras 1980, Featherman 
1981, Simkus 1981, Campbell 1983, Kerckhoff 1984), and more detailed in 
our review of the third generation and subsequent developments. Our review 
of the third generation covers part of the same ground as Kurz & Miller 
(1987), but with different conclusions. 
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Our topic, intergenerational stratification research (or “social mobility” as 
understood by the first generation), includes both bivariate accounts of the 
transfer of status positions from parents to their offspring and multivariate 
accounts of the same processes, where, minimally, educational achievement 
is studied as an intervening variable. We also include some discussion of the 
consequences of social mobility. We exclude those issues in stratification 
research that do not have immediate intergenerational aspects, such as income 
attainment and worklife mobility (Kurz & Müller 1987, Kalleberg 1988). 
Unfortunately, we also have to exclude from review the intergenerational 
transfer of material possessions (other than through occupational inheritance), 
not because it does not occur but because this topic has scarcely been dealt 
with in the literature (Cheal 1983). 


THE FIRST GENERATION 


Although Sorokin’s (1959 [1927]) Social Mobility is generally acknowledged 
as the starting point of (comparative) social stratification and mobility re- 
search in modern sociology (Heath 1981), only after the Second World War 
did systematic national studies begin to appear. A monograph by Glass (1954) 
on 1949 data for England and Wales was the impetus for the establishment of 
the Research Committee on Social Stratification and Social Mobility of the 
International Sociological Association, which since its founding has been a 
major locus of scientific exchange, data sharing, and international collabora- 
tion. At Glass’s instigation, a group of researchers from 12 countries decided 
in 1950 to collect data using a common framework (Rokkan 1951). This 
common framework included, among other elements, periodic national social 
stratification and mobility surveys in each of the countries, to investigate the 
determinants of social mobility and its consequences for “class identification 
and class antagonism” (Svalastoga 1959:22), and the creation of an occupa- 
tional prestige scale in each country as a basis for measuring intergenerational 
relationships. These plans were realized in some, but not all, of the countries. 
Svalastoga’s (1959) monograph on Denmark based on his 1953 survey re- 
mains the best known example, but similar plans were carried through in 1955 
in Japan (JSSRC 1956, 1958) and in 1954 in the Netherlands (van Tulder 
1962). Monographs following the main lines of the agenda were written on 
1954 Puerto Rican data (Tumin & Feldman 1961) and 1954 Swedish data 
(Carlsson 1958). The first generation research gained a comparative thrust 
through the work of Lipset & Zetterberg (1956), Lipset & Bendix (1959), 
and, in particular, Miller (1960). Lipset and Zetterberg compiled a set of 
fourteen 3*3 and 2*2 intergenerational mobility tables for 10 countries and 
concentrated only on manual/nonmanual mobility; the Miller analysis in- 
cluded 20 tables of varying size and breadth of coverage for 17 countries and 
investigated more detailed types of social mobility (e.g. elite mobility). 
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Research in the style of the first generation continued well after the main 
focus shifted to second generation research. Several researchers have com- 
piled collections of published mobility tables and analyzed them with methods 
more or less similar to those utilized by Lipset & Bendix and by Miller. The 
Miller collection of tables was extended and reanalyzed by several researchers 
(Marsh 1963, Fox & Miller 1965, Svalastoga 1965, Lenski 1966, Cutright 
1968, Jones 1969, McClendon 1980, Raftery 1985). An entirely new collec: 
tion of tables from studies conducted subsequent to 1960 was created by 
Hazelrigg (Hazelrigg 1974, Hazelrigg & Garnier 1976). Some of the same 
data were employed by Tyree et al (1979), who analyzed 24 2*2 tables, and 
Grusky & Hauser (1984), who analyzed 16 3*3 tables. However, whereas the 
last two articles employ some of the data and data collection methods of the 
first generation, they used third generation analytic methods. 

Although the common framework for the first generation studies included 
national occupational prestige inquiries as a basis for determining occupation- 
al status, in the end most published tables were not based on prestige scale 
scores. Instead, each researcher produced an ad hoc occupational classifica- 
tion. As a result, comparability across studies could only be obtained by 
collapsing the original occupational classifications into three highly aggre- 
gated categories: farm, manual, and nonmanual occupations; moreover, in 
some studies only a manual/nonmanual distinction was made. Collapsing into 
these two or three category schemes proved for many years to be the only 
means of obtaining comparability between published mobility tables from 
different countries. But in some cases comparability could not be achieved 
even in this way. Interestingly, the tables most often used to illustrate new 
mobility models—those for England & Wales and Denmark—cannot be 
collapsed into nonmanual, manual, and farm occupations and are therefore 
mostly excluded from later comparative studies.! 

Methodologically, much of the analysis in the first generation involved 
little more than the inspection of inflow and outflow percentages (Lipset & 
Zetterberg 1956, Miller 1960). However, some researchers recognized that 
observed mobility rates are a function of the marginal distributions and 
therefore cannot be used for comparative analyses. Several proposals were put 
forward to distinguish observed mobility rates from mobility chances net of 
differences in marginal distributions. The renowned "mobility ratio" was 
more or less independently arrived at by Glass (1954), Goldhamer & Rogoff 
(Rogoff 1979 [1953]), and Carlsson (1958), but it turned out to be inadequate 
to accomplish the separation of net mobility chances from the marginal 
distributions (Tyree 1973, Hauser 1978). 


"The reason for this, ironically, was that Glass and Svalastoga did use prestige for occupation- 
al scaling (as envisioned in the original ISA plan). 
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The basic comparative question of this generation was to what extent and in 
what ways countries differ in their mobility patterns. The best remembered 
conclusion is that of Lipset & Zetterberg (in Lipset & Bendix 1959; see also 
Lipset & Zetterberg 1956) that “the overall pattern of social mobility appears 
to be much the same in the industrial societies of various Western countries," 
which was offered in reaction to the prevailing assumption that the United 
States, as a "new" nation, would exhibit more intergenerational mobility than 
other western industrial nations. But the conclusion has not withstood early 
(Miller 1960) and later (Jones 1969, Hazelrigg 1974) reanalyses. 

A second important hypothesis was that mobility rates tend to be higher in 
industrialized societies than in nonindustrialized societies (Fox & Miller 
1965, Lenski 1966:410—17). Fox & Miller, Lenski, and several other re- 
searchers as well, found a positive relationship between indicators of eco- 
nomic development and indicators of social mobility (Marsh 1963, Cutright 
1968, Hazelrigg 1974), but their substantive conclusions have been contested 
by Goldthorpe (1985). 

A third concern of the first generation researchers was the effect of political 
structure on the extent of intergenerational mobility. Fox & Miller (1965) 
claimed to find a relation between the degree of political stability and the 
amount of mobility. Connor (1979) argued that state socialist regimes pro- 
mote social mobility and found some support for this proposition in an 
analysis of intergenerational mobility rates in Eastern European countries. 

Interestingly, there were many ancillary research questions in this genera- 
tion, but only two were addressed comparatively. One was the consequences 
of mobility for voting behavior. Lipset & Bendix's (1959) five country 
comparison claimed to find clear evidence of a mobility effect. Some subse- 
quent comparative studies also have dealt with this issue (Barber 1970, 
Abramson 1973), but this topic migrated from stratification research to 
political science and has received little subsequent attention in either disci- 
pline. The other was whether occupational prestige hierarchies in different 
countries are similar; the tentative answer of Inkeles & Rossi (1956), later 
confirmed rigorously by Treiman (1977), was that they are. 

Many other ancillary research questions were posed in one country or 
another but received little comparative attention, e.g. the extent of assortative 
mating by social origins and by education (Hall 1954), the effect of social 
mobility on fertility (Berent 1954), and the effect of social status on life style 
(Svalastoga et al 1956, Svalastoga 1959). Finally, many researchers were 
aware of the pivotal role of educational attainment in the intergenerational 
transmission of advantage (Glass 1954, Carlsson 1958, Tumin & Feldman 
1961); but, given the limited statistical models available then, they were not 
able to answer the crucial question: how much (im)mobility is mediated 
through education. 
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THE SECOND GENERATION 


The inception of the second generation of social mobility research was 
prompted by three related innovations connected with the name of O. D. 
Duncan. First, Blau & Duncan's (1967) US study (OCG D set new standards 
for data collection. An important innovation was the coding of occupations 
into the categories of the US Census three-digit occupational classification 
scheme. This created the possibility of detailed comparative analysis. Second, 
Duncan (1961) introduced a new scale for occupational status to be used with 
continuous data analysis techniques, his renowned SEI. It measured the status 
of each occupational category by the average education and income of 
incumbents of that occupation, thus tapping the major resources of individuals 
in the process of stratification. Third, and most important, the introduction of 
indirect effects (path) models into sociology (Duncan & Hodge 1963, Duncan 
1966b) led to the formulation of the Blau & Duncan (1967:Ch. 5) status 
attainment model, which made it possible to assess the relative importance of 
education and family background for status attainment. The model also 
included respondent's first occupation, thus allowing the assessment of occu- 
pational career mobility and creating the possibility of assessing historical 
trends via cohort analysis. An obvious extension of this approach was to 
measure occupational status at several points in the career and to estimate 
career chain models (Blau & Duncan 1967:184, Featherman 1971, 1973, 
Kelley 1973a,b). 

Unlike Glass's example, Duncan's research was never explicitly adopted as 
the agenda of the ISA Research Committee on Stratification. Nevertheless, 
the reaction to Duncan's work at the comparative level was swift and exten- 
sive. Comparisons of status attainment models in two or three countries were 
soon published by Machonin (1970), Jones (1971), and a number of other 
researchers (see Treiman & Ganzeboom 1990). National stratification and 
mobility surveys similar to or even broader in scope than the OCG-I study 
were conducted in Australia in 1965 (Broom & Jones 1969, 1976), Japan in 
1965 (Odaka & Nishihira 1966), and Czechoslovakia in 1967 (Machonin 
1969, Safar 1971), but the major emulation of Blau & Duncan's work in other 
countries was in the early 1970s, simultaneously with the 1973 OCG replica- 
tion directed by Featherman and Hauser (see Featherman et al 1974 and 
Broom & McDonnell 1974). National social stratification and mobility sur- 
veys were conducted in at least thirteen countries: in 1970, France (Thelot 
1982); in 1972, England and Wales (Goldthorpe 1987) and Poland (Andorka 
& Zagorski 1980); in 1973, Australia (Jones & Davis 1986), Canada (Boyd et 
al 1985), Hungary (Andorka 1983), Ireland and Northern Ireland (Hout & 
Jackson 1986, Hout 1989), and the United States (Featherman & Hauser 
1978); in 1974, Israel (Kraus & Hodge 1990) and Scotland (Payne 1987); and 
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in 1975, Italy (Ammassari 1978) and Japan (Tominaga 1979). All these 
studies were designed as stratification and mobility studies; they all contain 
detailed occupational codes, and, with a few exceptions,” unit record data 
files were prepared for public distribution. 

For measurement, the studies of this generation relied upon continuous 
scales. The gradual accumulation of occupational prestige scales finally 
resulted in a large-scale comparative study by Treiman (1977). The national 
prestige measures turned out to be highly comparable, and the Standard 
International Occupational Prestige Scale that integrates them has gradually 
become accepted as a valid measure of occupational prestige for comparative 
analysis. However, at the same time Featherman & Hauser (1976) showed 
that prestige measures underestimate the true degree of intergenerational 
transmission of occupational status and cast doubt on the usefulness of 
prestige as a measure of occupational status for the study of social mobility. 
The crucial difference between prestige and socioeconomic status is the 
position of farmers. Whereas farmers enjoy about average prestige around the 
world, they tend to be near the bottom of socioeconomic status scales such as 
Duncan’s SEI. Since the sons of farmers who leave farming tend to be 
concentrated in low status (and low prestige) unskilled or semiskilled jobs, 
SEI scales show more intergenerational association than do prestige scales. 
SEI scales were constructed for a number of countries: e.g. Canada (Blishen 
1967) and Australia (Broom et al 1977), but no international counterpart of 
Duncan’s SEI has yet been published (but see Ganzeboom et al 1989a). 

The research questions of this second generation were quite different from 
those in the first generation. The Blau-Duncan model reformulated the old 
question of how much intergenerational occupational mobility there is in a 
country into the new ones of how the (direct) influence of father’s occupation 
on son’s occupation? compares with that of other background factors, es- 
pecially education, and how much it is mediated by the status of the son’s first 
job. For the United States in 1962, the total correlation between son’s current 
occupation and father’s occupation was .405. This total correlation can be 
decomposed into an indirect effect via education of .227 (57%) and an effect 
net of education (direct or through the first job) of .178 (43%). The de- 
composition of the effect net of education reveals that .063 (16% of the total) 
arises from the effect of the status of father’s occupation on the status of the 
son’s first job, and the status of the first job on that of the current job, whereas 
.115 (28%) arises from the direct effect of father’s occupational status on the 
status of the current occupation. Another meaningful parameter in this 


?The Canadian data are accessible but, because of concerns of Satistics Canada regarding 
confidentiality, the public use files do not contain detailed occupational codes. The French data 
are sold at a prohibitively high price. The Italian data have yet to enter the public domain. 
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framework is the ratio of the effect of education on current occupation to the 
effect of father's occupation on current occupation (omitting consideration of 
the first job). For the Blau-Duncan model it is 2.9:1, thus warranting the 
conclusion that in mid-century United States, achievement was more impor- 
tant than ascription in determining occupational status. The answer to the 
question of the extent to which educational attainment promotes social mobil- 
ity thus turned out to be compound: Respondent's occupational status is more 
related to education than to father's occupation, and most of the effect of 
education is independent of social origins, so the main role of education is to 
promote social mobility; but at the same time a majority of what social 
reproduction there is is transmitted through education, so education is also the 
main vehicle of social reproduction. 

The main hypothesis of Blau & Duncan was similar to one of the major 
hypotheses of the first generation: industrialization promotes achievement and 
reduces ascription (Parsons 1940, Kerr et al 1960, Lenski 1966). However, 
whereas researchers of the first generation thought that such a shift implied an 
increase in the overall rate of intergenerational mobility, Blau & Duncan 
(1967:429) offered a more refined hypothesis: as societies industrialize, the 
importance of achievement processes, i.e. the influence of respondent's 
education relative to that of parental characteristics, increases, and the im- 
portance of ascriptive processes, i.e. the influence of family background, 
decreases. They sought to test this hypothesis for the United States via cohort 
comparisons, by studying the determinants of education and the status of the 
first job, and found no clear trend over time. 

A number of hypotheses about how status attainment varies across societies 
were proposed by Treiman (1970). Among the most important of these was 
the conjecture that in more economically developed countries the direct effect 
of parental status on respondent's education and the status of the current 
occupation is weaker than in less developed countries. These ideas were not 
tested on a large scale, but a number of limited studies contrasting the United 
States with less developed countries were carried out (see Treiman & Ganze- 
boom 1990:110—-15 for a summary). Apart from the obvious weakness of 
testing hypotheses about specific societal variations on the basis of two or at 
most a handful of cases, the use of the United States as the contrast con- 
founded the effect of economic development with the (possible) distinctive- 
ness of the United States. 

Just as the effect of industrialization on mobility reemerged as an issue in 
the second generation, so did the effect of political structure. Heath (1981) 
suggested, and provided support for, the hypothesis that in communist and 


For reasons that will be elaborated below, most of the analysis to date of intergenerational 
status transmission has been limited to men. 
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social-democratic nations the effect of father's occupation on son's occupa- 
tion tends to be smaller and the effect of son's education on son's occupation 
to be larger than in politically conservative countries. In an analysis of the 
1952 Bolivian revolution, Kelley & Klein (1981) argued that in the short run 
(but not in the long run) revolutions promote social mobility. 

Duncan's work stimulated a number of ancillary research questions. Dun- 
can et al (1972) sought to broaden the scope of status attainment research by 
introducing cognitive ability and motivational variables. This effort was taken 
up in a long-term investigation of a cohort of high school graduates by Sewell 
and his associates, which generated a major literature on the social psycholog- 
ical aspects of the status attainment process (Sewell & Hauser 1975). Other 
researchers explored such factors as the effect of place of residence, ethnicity, 
religion, and career contingencies such as age at marriage and child bearing, 
on status attainment. The first generation questions regarding homogamy 
(Blau & Duncan 1967:Ch 10) and fertility (Duncan 1966a, Blau & Duncan 
196:Ch 11) also were addressed in the second generation, but with more 
sophisticated models. However, few of these ancillary issues were pursued on 
a comparative basis. 

An important advance in the second generation, which flowed directly from 
the introduction of simultaneous structural equations as the modelling tool 
(Jóreskog 1970), was the assessment of and correction for measurement 
unreliability. Bielby et al (1977) were the first to show how to incorporate 
error corrections in intergenerational occupational attainment models. But, 
again, there was little echo of these efforts in the comparative literature. 

Finally, one can categorize the second generation by the issues that were 
not addressed. One of the most conspicuous of these was how social mobility 
affects political formations, which had been of interest to the first generation. 
This problem would have been tractable in the second generation. This is so 
because political scientists (who have been much more successful than stu- 
dents of social mobility in establishing an internationally standard research 
design (e.g. Barnes et al 1979) have routinely included father's occupation as 
well as father's party affiliation in their surveys, a fact that has passed largely 
unnoticed by social mobility researchers. Less conspicuous, but equally 
important, was the virtual disappearance of items on life style and other 
consequences of social status from the data collected in the second generation. 
Actually, it might be more accurate to say that concern with life-style issues 
has hibernated in Eastern Europe, where a very strong interest developed in 
culturally defined social inequality (Wesolowski & Slomczynski 1968, 
Machonin 1969, 1970, Robert 1984). Via cultural reproduction theory (Bour- 
dieu & Passeron 1977) this topic has returned to the center stage of in- 
tergenerational stratification research (DiMaggio 1982, DiMaggio & Mohr 
1985, Robinson & Garnier 1985, De Graaf 1988), but has not yet become 
comparative. 


286 GANZEBOOM, TREIMAN & ULTEE 


It has been widely acknowledged that the status attainment model revolu- 
tionized social mobility research and, for that matter, sociology at large (Kurz 
& Müller 1987). However, it is also fair to say that the second generation of 
intergenerational stratification research has never fulfilled its comparative 
promise. Status attainment models now exist for many nations, not only in 
Europe and North-America, but also for a number of countries or regions in 
Latin America, East Asia, and sub-Saharan Africa (Treiman & Ganzeboom 
1990). However, most comparative analyses of status attainment deal with 
only two to four countries and therefore have little discriminatory power. 
Comparing the coefficients from published status attainment models is not a 
viable strategy because such models tend to differ both in the variables 
included and in the way they are measured. For a long time the research effort 
needed to reanalyze data at the unit level turned out to be prohibitive, and only 
recently have really large-scale comparisons of status attainment models 
begun to appear. Treiman & Yip (1989) estimate an elementary occupational 
status attainment model for 21 countries, starting with unit record data and 
standardizing education (years of school completed) and occupational status 
(Treiman's international occupational prestige scale). On average, father's 
and son's occupations correlate .345 (median), with a minimum of .226 in 
Italy and a maximum of .547 in India (additional information provided by 
Treiman). The median percentage of direct transmission was 45%, with a 
high of 93% in India and a low of 13% in Sweden. The median ratio of the 
effect of respondent's education to that of father's occupation was 3.1, with a 
high of 16.0 for Sweden and a low of .3 for India. Treiman & Yip included 
explicitly measured contextual variables in their analysis, and they show 
rather strong relationships between industrialization and the components of 
the status attainment model. 


THE THIRD GENERATION 


The third generation of stratification research got underway before the second 
generation really finished. This is literally true, since in the beginning of the 
1970s, members of the ISA Research Committee, working within the status 
attainment approach, had already agreed to conduct the above described series 
of (loosely coordinated) national surveys (Broom & McDonnell 1974, 
Featherman et al 1974). These surveys were conducted between 1972 and 
1976. However, by the time the data became available for comparative 
analysis, the new exemplary studies of Hauser (Hauser et al 1975a,b, Hauser 
1978, Featherman & Hauser 1978), and Goldthorpe (Goldthorpe & Llewellyn 
1977a,b, Goldthorpe et al 1978, Goldthorpe 1987) had prompted a massive 
shift in the dominant thrust of stratification research. Multivariate linear 
regression models were replaced by a variety of loglinear models, among 
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which the levels (or "topological") model introduced by Hauser (1978) is 
dominant. The methodological advantages of loglinear models over con- 
tinuous data models such as correlation and regression are believed by their 
advocates to be twofold. First, loglinear models provide a technically ade- 
quate way to distinguish absolute mobility from relative mobility chances 
(social fluidity). Second, such models make it possible to treat a bivariate 
association as a multidimensional pattern (Hout 1984) and, in particular, to 
model the diagonal (which represents class immobility) separately from the 
off-diagonal cells. 

Some analysts have turned to loglinear modelling for mundane method- 
ological reasons, but others have made a substantive case for doing so 
(Goldthorpe 1987). Class theorists in the field of social mobility argue that 
social classes are intrinsically discrete and unordered, and hence that ex- 
change relationships between social classes are not properly modelled using 
"hierarchical" measures and the linear models of the second generation of 
stratification research. Loglinear levels models make it possible to deal with 
pairwise and asymmetric exchange relations between social classes, without 
any assumptions regarding the ordering of the classes. 

The ünofficial program of the third generation of social mobility research in 
the late 1970s and the 1980s became more or less institutionalized in the 
CASMIN project (Comparative Analysis of Social Mobility in Industrial 
Nations), directed by Goldthorpe and Müller (1982), with Erikson as their 
main collaborator. The CASMIN project extended the comparison of in- 
tergenerational mobility patterns to 13 industrial nations (7*7 tables), both 
Western and East-European, and constituted the first attempt to conduct 
cross-national intergenerational mobility research by recoding and standardiz- 
ing detailed high quality unit data from national social mobility surveys in a 
substantial number of countries. 

The substantive results from the CASMIN project are several. First, a 
common system of broad class categories (the EGP categories, after Erikson, 
Goldthorpe & Portocarero 1979) came into use. These categories have been 
widely accepted as a standard classification of occupational classes for com- 
parative research." Similar efforts are underway for the measurement of 
educational status, an even more difficult problem (Lüttinger & Kónig 1988). 

Second, the model of class mobility proposed by the CASMIN researchers 
has conclusively established the existence of multidimensionality and dis- 
continuities in intergenerational occupational mobility patterns. Erikson & 


“Their relationship to the International Standard Classification of Occupations (ILO 1969) was 
documented in Ganzeboom et al (1989c). The International Standard Classification of Occupa- 
tions (ISCO), developed by the International Labor Office, is used widely—either exactly or with 
minor modifications—by central statistical agencies and census bureaus in many countries. 
Hence, using it greatly facilitates data comparability and, thereby, comparative research. 
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Goldthorpe (19872) have arrived at the CASMIN “core model”, which con- 
sists of a loglinear model with superimposed levels that are associated with 
substantively interpretable parts of the mobility pattern: inheritance effects, 
hierarchical effects, sectoral effects, and (somewhat less interpretable) (dis) 
affinity effects. 

Third, the CASMIN researchers claim support for a number of specific 
substantive conclusions (Erikson & Goldthorpe 1985, 1987a,b): (a) In- 
heritance effects and sectoral effects are more important than hierarchical 
effects in explaining relative mobility patterns. This directly challenges the 
assumption made in the status attainment literature (but see below). (b) 
Relative intergenerational occupational mobility patterns do not differ much 
between countries. The core model of relative mobility ("common social 
fluidity") is applicable to all industrial nations; specific deviations from this 
model occur in various countries, but these deviations are to be accounted for 
in terms of peculiarities of each country's history. However, relative mobility 
is slightly higher in Sweden (a nation with a long-term social-democratic 
tradition) and the United States (until recently the economically most ad- 
vanced country) than in the other countries they analyzed. 

However, other researchers have arrived at quite different conclusions, 
sometimes using the same data. Wong (1990) compared data from the 1970s 
from the United States, England, Hungary, Japan, Poland, and Brazil, and 
concluded that the intergenerational occupational mobility pattern between 
these countries varies strongly. Ganzeboom et al (1989c) compared 149 6*6 
tables from 35 countries and found large between-country variation in the 
parameters of mobility tables, thus strongly challenging the thesis of common 
social fluidity. Other evidence suggests that in at least some countries relative 
intergenerational mobility chances have increased over time: Goldthorpe & 
Portocarero (1981) on France; Erikson (1976, 1983) on Sweden; Hout (1988) 
on the United States; Ganzeboom et al (1989b) on Hungary; Ganzeboom & 
De Graaf (1984) and Luijkx & Ganzeboom (1989) on the Netherlands. Also, 
Ganzeboom et al (1989c) found significantly increasing relative mobility 
chances for 16 of the 18 countries for which they had replicate data and 
estimated a decline in the relevant parameters of between one and two per cent 
per year. 

The loglinear levels models utilized in the CASMIN research have some 
drawbacks, which may account for the contradictory conclusions. Their most 
unattractive property is that they do not yield an over-all characterization of 
the mobility regime. For example, the CASMIN core model disaggregates the 
association in the relative mobility distribution into eight different parameters 
(Erikson & Goldthorpe 1987a). From a statistical point of view, the dis- 
criminatory power of levels models is limited, since they spread the differ- 
ences in association between tables among multiple degrees of freedom, 
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which is a crucial weakness for comparative research (Hauser 1984a, Yama- 
guchi 1987). The scaled association models introduced by Goodman (1979) 
and first applied in comparative mobility research by Hauser (1984a) have 
much greater discriminatory power. The Goodman-Hauser model estimates 
an a posteriori mobility dimension upon which classes are ordered with 
respect to the relative mobility chances between them; also, separate parame- 
ters can be estimated for some or all of the diagonal densities (which represent 
the probability of remaining immobile). 

To our knowledge all existing analyses of intergenerational occupational 
mobility patterns that derive scale scores a posteriori from the extent of 
exchange between occupational categories, whether in the form of the Good- 
man-Hauser model (Hauser 1984a, Ganzeboom et al 1989c), multi- 
dimensional scaling (Laumann & Guttman 1966, Blau & Duncan 1967), or 
canonical correlation analysis (Klatzky & Hodge 1971, Domanski & Sawins- 
ki 1986, 1987), have found a single or very strong dimension that resembles 
the rank order of occupational classes with respect to socioeconomic status, 
with professionals, managers and owners of large businesses at one extreme 
and agricultural workers at the other. If anything is constant across time and 
societies, it seems to be this simple pattern of relationships between occupa- 
tional classes with respect to their relative mobility chances: the main determi- 
nant of the probability of exchange between occupational categories is their 
similarity with respect to socioeconomic status. The explanation for this 
finding is that the socioeconomic status of occupations is a good proxy for the 
myriad of resources that promote the intergenerational transmission of advan- 
tage, and also for the extent of advantage gained. It is to be noted that three of 
the four components of the CASMIN core model (hierarchy, sector, (dis-) 
affinity) are related to socioeconomic status, which leaves inheritance effects 
as the main form of discreteness. f 

The third generation of social mobility research has considerably narrowed 
the scope of the field. Through loglinear modelling we have learned more 
about what is, in fact, only a bivariate distribution. Earlier multivariate 
research questions, as well as most of the ancillary research questions of the 
first and second generations, have been dropped from the agenda of the third 
generation (although studies addressing these questions continue to appear in 
the literature). There is some work by third generation researchers on the 
relation between father’s occupational class, the class of the respondent’s first 
job, and respondent’s present class (the question of intragenerational mobil- 
ity) (Hope 1984, Erikson & Goldthorpe 1987c), on the role of education in the 
transmission of class from father to son (Yamaguchi 1983, Hout 1988, Müller 
et al 1988, Semyonov & Roberts 1989), and on homogamy (Ultee & Luijkx 
1990), but comparative research in these areas is underdeveloped. Oddly, 
though, the third generation, with its predilection for class concepts, 
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has ignored even the research problem that gave rise to the class approach, the 
question of class mobility and political fcrmation (Kurz & Müller 1987). 
Finally, the issue of data reliability has siriply been forgotten. One is well 
justified in the assertion that many of these questions have not been resolved 
by earlier research and should therefore be considered as prematurely aborted. 

On the theoretical level, not many new iceas have emerged. Interestingly, 
in their pre-CASMIN work, Erikson & Goldthorpe (Erikson et al 1979) were 
concerned with the influence of political mstitutions on mobility regimes, 
arguing that France, England, and Sweden are similar in economic develop- 
ment but different in political climate. Moreover, most of their post-hoc 
arguments on differences between the CASMIN countries (Erikson & Gold- 
thorpe 1987b) deal with factors other than economic and industrial develop- 
ment: legally institutionalized relationships between education and the labor 
market (the German Lehrstelle [apprenticeskip] system), which are claimed to 
create a particularly wide gap between marual and nonmanual classes; wel- 
fare transfers and low income inequality im Sweden, which are claimed to 
promote social mobility across the board and the socialist abolition of 
proprietorship in Hungary and Poland, which is claimed to have decreased the 
degree of occupational class inheritance. Thare seems to be some opportunity 
for an institutional theory of social mobility, but this literature has not 
produced a concise or coherent formulatiom of it. 


NEW DEVELOPMENTS 


Subsequent to (and to some extent alongside) the shift of emphasis to log- 
linear modelling, there have been a number of additional developments in 
intergenerational stratification research. In addition, there have been a num- 
ber of suggestions for new analytic strategies that merit greater attention than 
they have thus far received. We here highlight developments and suggestions 
in six areas. 


New Data Collection 


Although the core surveys analyzed in the first three generations were ex- 
plicitly designed as stratification studies, datasets collected for other or 
general purposes increasingly have been utilized for comparative analysis. In 
an increasingly large number of countries annual or bi-annual general social 
surveys are conducted, e.g. the US General Social Survey. Most of these 
surveys originate within the field of attitude and value research. Researchers 
from (by now) 12 countries with an interest in these subjects have joined 
forces in the International Social Survey P-ogram (Becker et al 1990) and 
have agreed to include a common module on a specific topic in each annual 
survey, plus a common set of social, demcgraphic, and family background 
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characteristics directly pertinent to students of intergenerational mobility. 
Second, surveys in two other fields have turned out to be rich sources of 
intergenerational stratification data, namely demographic surveys and—even 
more so—election surveys (Niemi et al 1985). In both fields, international 
coordination has been much more successful than in stratification research. 

The abundance of existing data from which elementary intergenerational 
stratification models can be estimated is certainly desirable, since exploitation 
of such data will sharply increase the statistical power of comparative analy- 
ses, not only by adding new countries to the pool of evidence but even more 
so by adding over-time replicates. This will not only lead to better historical 
comparisons, but also to a reduction of random variance in within-country 
patterns. Reducing error variance and increasing statistical power should be of 
great concern to comparative stratification analysts (as well as to other 
comparative analysts), since true variability in mobility rates across time and 
space is probably modest (but substantial in the long run). Future analyses of 
more extensive bodies of data might well lead to the conclusion that sub- 
stantively important differences simply have been swamped by random error 
in previous analyses. 


Event History Models 


Meanwhile, the field of stratification is responding once again to a method- 
ological innovation: event history models. Although earlier stratification 
surveys sometimes included detailed educational and occupational histories, 
until the introduction of event history analysis into sociology (Tuma & 
Hannan 1984, Blossfeld et al 1989) not many analysts had found a proper way 
to analyze such data (although the Blau-Duncan model provided an 
elementary way to study careers by including both the first and current 
occupations). The collection of information on each job in the career and the 
treatment of job spells or time periods as the units of analysis makes possible 
the simultaneous analysis of the career structure and historical effects (Bloss- 
feld 1986). It also provides a solution to one of the difficulties common to 
mobility tables and status attainment models: The moves implied by these 
models are not located in historical time and hence it is impossible to relate 
them precisely to historical circumstances (Sórensen 1986). 

To date, not many comparative analyses of work histories have appeared; 
Allmendinger (1989) is a notable exception. In addition, the authors of event 
history analyses have tended to ignore the central questions about in- 
tergenerational stratification patterns. Although nothing forbids the inclusion 
of family background in career analyses (this would amount to a detailed 
analysis of the point in the respondent's career and the historic time when the 
influence of fathers occurs), not many articles have reported on this (an 
exception is Sórensen & Blossfeld 1989). A final peculiarity of this approach 
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is that three of the still small number of datasets that contain life history 
data—the US, Norwegian, and German stadies (Allmendinger 1988)—are 
restricted to a limited set of birth cohorts. 


Multiple Indicator and Sibling Models 


An interesting dead-end in stratification research in the last decade appears to 
have been the multiple indicator approach. I is obvious that stratification data 
suffer from unreliability and invalidity, as do other data. Reporting errors, 
coding errors, recoding errors, and incomparability of measures between 
studies each take their toll. This leads to oserestimation of direct effects in 
status attainment models, relative to indirect effects (Kelley 1973a). The 
standard reaction has been to refine measures or throw away unreliable data, 
instead of repeating the measurement via rrultiple indicator designs. Never- 
theless, several interesting and viable multiple indicator analyses have 
appeared in the literature, but none of them has inspired much replication. 
Bielby et al (1977) and Hauser et al (1983) : mplemented a multiple measure- 
ment design using repeated measurements from interviews conducted at 
different points in time. Others have reinterviewed part of their sample or 
have gone back to marriage records in order to assess the reliability of the 
measurement of parent's status (Broom et al 1978, Massagli & Hauser 1983); 
an obvious strategy for using such data woul be to apply known or estimated 
reliability coefficients to status attainment models. Another noteworthy pro- 
posal is to overcome incomparabilities across countries by estimating multiple 
indicator models with both indigenous and internationally comparable in- 
dicators (Krymkowski 1988). 

Strongly related to the multiple indicator approach and even less well 
developed is the use of sibling models (Teubman 1977, Hauser & Mossel 
1985, Hauser & Sewell 1986), which include two (or more) descendants of 
the same parental family in intergeneratioral stratification models. Sibling 
models provide an unbiased estimate of the total parental influence on off- 
spring's outcomes whereas the standard des gns suffer from omitted variable 
bias (Hauser 1984b, Hauser & Mossel 1985). Sibling. models are therefore 
capable of directly addressing the central question of intergenerational mobil- 
ity research: the extent and determinants of social reproduction. (In- 
terestingly, Duncan anticipated this develooment because he included var- 
iables for respondent's oldest brother in path models to obtain better estimates 
of parental effects—Blau & Duncan 19€7:Ch 9). If information is in- 
dependently collected from at least two sibl:ngs, multiple indicator measure- 
ment comes as an additional advantage of -he sibling approach, since each 
person interviewed can provide an indeperdent measurement of all the in- 
formation in the model (Hauser & Wong 1929). To our knowledge, there has 
as yet been no comparative work on this topic, although some data are 
available. 
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Multivariate Models with Categorical Variables 


The obvious next step in intergenerational stratification research is to combine 
the virtues of the second and third generation of research, estimating relative- 
ly complex multivariate models and at the same time adequately treating the 
discreteness and non-uniformity of the core variables of social stratification. It 
is obvious that a full categorical treatment of all variables is still beyond the 
state of the art; at present, the best we can do is to estimate mixed models that 
treat some of the variables as continuous (Winship & Mare 1983). One way to 
do this is to reformulate loglinear models as logit models with continuous (as 
well as categorical) covariates (Logan 1983). This will reintegrate the inter- 
vening variables from the Blau-Duncan model, in particular educational 
attainment, with intergenerational occupational mobility models. 

One variety of these models, ordered logit models, has gained prominence 
as a way of carrying out cohort analyses of a part of the status attainment 
model—educational attainment. Mare (1981) applied ordered logit models to 
transitions between subsequent levels of educational attainment in order to 
separate marginal effects (educational growth over cohorts) from the in- 
fluence of parental background on the probability of making each transition. 
He shows for the United States that the influence of parental status on 
educational outcomes is substantially weaker for higher than for the lower 
transitions. This observation has been substantiated in a number of subsequent 
studies: Smith & Cheung (1986) on the Philippines, Shavit & Kraus (1990) on 
Israel, and for a dozen additional countries in a series of papers presented at 
the 1990 Madrid World Congress of Sociology (to be published in Shavit & 
Blossfeld 1991). Given this pattern of effects and the ubiquitous growth of 
educational attainment over cohorts in virtually all countries, it follows that 
the metric regression of educational attainment on parental background will 
decrease over cohorts (assuming that the compositional effects are not com- 
pensated by historically increasing inequality of educational opportunities at 
the higher level transitions). This provides one possible explanation for the 
cross-temporal increase in relative mobility that has been observed in in- 
tergenerational occupational mobility studies. Moreover, in another multi- 
variate study on the United States, Hout (1988) shows that the influence of 
father's occupation on respondent's occupation is larger for the less educated, 
thus providing another hypothesis as to how educational growth can promote 
increased societal openness. 


Women and Families 


Given the strong concentration on occupational status, stratification research- 
ers have found it difficult to deal with women (Acker 1973). To begin with, 
for the respondent's generation, many married women are outside the labor 
force. This is even more true for women in the parental generation (mothers). 
An additional impediment is that the occupational distribution of women is so 
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different from that for men that applying existing status or class categories to 
women is problematic (Bielby & Baron 1986). Faced with these problems, 
many of the major data collection efforts in the first two generations simply 
excluded women from the sample altogethe-. Most of our comparative knowl- 
edge on the intergenerational mobility of women therefore stems from other 
sources than the major social mobility surveys (Roos 1985). Roos shows that 
throughout the world.the process of educational and occupational status 
attainment is similar for men and women, except that the direct effect of 
father's occupation on occupational status is weaker for women. However, 
women’s occupational class position is quite different from that of men and is 
somewhat less associated with father's class position than is true of men, 
particularly with respect to class immobility (Portocarero 1983a,b). As com- 
pared to their fathers, women are on average more upwardly mobile than are 
men (i.e. they enjoy higher social status tkan do men from similar origins), 
but this conclusion is likely to be continzent upon the exclusion of non- 
employed women and upon the (male based) status measure that is used 
(Blishen & Carroll 1978, Boyd 1982). 

One of the traditional arguments for the exclusion of women from social 
mobility research has been that the unit of stratification is the family, and not 
the individual, in conjunction with the arzument that the husband's status 
dominates the family's life chances. This letter assumption has become more 
and more questionable, if it was ever applicable. In recent years, the relation 
between the effect of husband's and wife's status on the family's social 
characteristics has stirred a hot debate in Britain (Goldthorpe 1983, 1984, 
Erikson 1984, Heath & Britten 1984, Stanvorth 1984, Goldthorpe & Payne 
1986). 

Although it is a commonplace that women should be included in all future 
stratification research, some additional rerrarks can be made with respect to 
why and how women’s statuses should be considered. An unresolved issue 
here is the measurement of women’s occupstional status. The fact that women 
are concentrated in a smaller number of occupations than are men should be 
incorporated in both measurement and stractural models. Apart from this, 
future research should take the issue of the family as the unit of stratification 
not as a debate about definitions but as an empirical problem. This requires 
developing models of how the status characteristics (and social origins) of 
each member combine to produce status ouscomes meaured at the level of the 
family rather than at the level of the individual (Haller 1981). This agenda 
provides a new role for the old problem of homogamy (Ultee & Luijkx 1990). 
Paradoxically, such questions gain importarce as more women enter the labor 
market and the traditional nuclear family is in decline, since in such circum- 
stances the stratification of individuals and the stratification of families is 
truly different. 
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Consequences of Social Mobility 


As noted above, questions about the consequences of social mobility received 
less attention in the second generation than in the first (but see Hodge & 
Treiman 1966 on racial prejudice and Hodge et al 1986 on income) and were 
essentially ignored by the researchers of the third generation. Nevertheless, 
there has been an important technical advance, Sobel’s (1981) "diagonal 
mobility" models, which provide a means of assessing the relative importance 
of two-identically categorized variables (e.g. father's and son's occupation) 
on a dependent variable, as well as an estimate of the effect of any combina- 
tion of categories. By this means it is possible to assess whether mobility 
per se has consequences above and beyond the additive effects of origins 
and destinations, as claimed in a number of early theoretical arguments 
(e.g. Janowitz 1958, Lipset 1960). Sobel (1985) applied this model to fer- 
tility. Another recent application is to voting behavior (De Graaf & Ultee 
1990). Cross-national comparisons of mobility effects are an obvious next 
step. 


CONCLUSIONS 


The developments in comparative social mobility research over the past 40 
years can be summarized as follows: 

With respect to data collection, much progress has been made. Data 
available for comparative analysis has gone from small numbers of highly 
aggregated published bivariate tables based on nonrepresentative or unknown 
samples from a few Western countries to unit record data containing many 
variables and highly detailed occupational and educational classifications 
compiled from large high quality sample surveys conducted in many countries 
throughout the world, often with several surveys available for a given coun- 
try. To date, however, many of the available data sets have not yet been 
exploited for comparative mobility or status attainment research. 

With respect to measurement much has been achieved as well. In particu- 
lar, the measurement of occupational position is well on the way to standard- 
ization across countries. Increasingly, data sets are coded (or recoded) into the 
detailed categories of the International Standard Classification of Occupations 
(ILO 1969). With the ISCO as a framework, several standardized measure- 
ment schemes have been developed. To a lesser extent, the problem of the 
comparative measurement of educational attainment has also been solved. 

With respect to techniques of data analysis, the field has moved from 
comparisons of inflow and outflow percentages, first to regression techniques 
and then to loglinear models. This is a considerable advance with respect to 
the treatment of bivariate relationships. It is now possible to decompose 
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intergenerational occupational mobility tables into meaningful components 
and to model association patterns independent of marginal distributions. Seen 
from a different angle, however, there has been substantial retrogression from 
the second to the third generation since analyses to date based on loglinear 
procedures have not been able adequately tc cope with multivariate problems, 
whereas the models of the second generat.on could do so easily. 

With respect to substantive issues, that is, problem development, consider- 
able constancy appears across the three genzrations. If there is a trend in this 
respect, it is an unfortunate one: the array of questions addressed in the first 
generation was much wider than in the second generation, and narrowed 
down still further in the third generation. Taere is also substantial continuity 
across generations in the theories investigated: hypotheses about economic 
development (modernization) compete wiih hypotheses about institutional 
factors. In our judgment, however, the modernization theory of social mobil- 
ity has been more substantially developed than the various ideas about 
institutional influences on social mobility patterns, which are at present 
underdeveloped. 

Finally, with respect to results, we find that some important insights have 
been secured (although far too few). In cur opinion, three generations of 
research have led to the following firmly established conclusions about the 
general pattern of intergenerational stratification: 


a. Throughout the world, intergenerational occupational mobility is driven by 
socioeconomic status as measured by sczles of the Duncan SEI-type; there 
is debate about the existence and nature of other dimensions, but it is clear 
that these are secondary to the effect o? socioeconomic status. 

b. Throughout the world, there is intergenerational occupational immobility 
in excess of what would be expected oa the basis of the distribution of 
socioeconomic status. Most of it occurs in classes with significant pro- 
prietorship (farms, businesses, and processional practices) and, in part, 
can be attributed to the direct transfer of ownership. The excessive amount 
of occupational inheritance constitutes the main discreteness in the process 
of stratification. 

c. Intergenerational occupational mobility patterns differ across time and 
countries. In most countries, there has 5een a slow but systematic trend 
toward increasing relative mobility in the years since the Second World 
War. Some countries have shown persistently bigher relative mobility 
rates than others (in particular the United States and Sweden). However, 
there is as yet no conclusive evidence regarding the contextual factors that 
determine these changes and differences. 

d. Research on status attainment suggests strongly that education is a more 
important determinant of occupational status than is parental occupation 
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and that the bulk of the effect of education is independent of social origins. 
At the same time education is an important mechanism for the transfer of 
advantage from generation to generation. 


It is evident that firmly established conclusions regarding societal variations 
and similarities in the structure and process of intergenerational social mobil- 
ity are not numerous. There is much work to be done. i 
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Abstract 


This paper reviews the literature on the neglected role of women in migration. 
It argues that focusing on gender and the family can provide the necessary 
linkage of micro and macro levels of analyses. Striving to contribute to a 
gendered understanding of the social process of migration, the review orga- 
nizes the literature along these major issues: How is gender related to the 
decision to migrate—i.e. what are the causes and consequences of female or 
male-dominated flows of migration? What are the patterns of labor market 
incorporation of women immigrants—i.e. what accounts for their participa- 
tion in the labor force and their occupational concentration? What is the - 
relationship of the public and the private—i.e. what is the impact of work 
roles on family roles and of the experience of migration on the immigrants 
themselves? Throughout, the necessity to understand how ethnicity, class, 
and gender interact in the process of migration and settlement is stressed. 


INTRODUCTION 


Despite the overwhelming presence of women in migration flows, until 
recently the role of women in migration had been totally neglected. As 
Houstoun et al (1984:908) have pointed out, the pervasive assumption that the 
international migrant is a young, economically motivated male has over- 
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shadowed the reality of migration streams. that were dominated by women: 
“Tronically, few immigration researchers (end even fewer policy makers) are 
aware that legal immigration to the United 3tates—still very much the largest 
of all international flows—has been domirated by females for the last half- 
century." For the United States a crossover in sex differentials in migration 
occurred in 1930, after which women anncally outnumbered men (Houstoun 
et al 1984: Table 1). As demographers, Hotstoun et al highlighted this glaring 
neglect as it appears in the discipline of seciology. For the field of history, 
Maxine Seller (1975:197) emphasized it when she pointed out that while 
much had been written about the achievements of the men who came to the 
United States from Southern and Eastern Europe at the turn of the century, 
much less appeared about the experiences of the immigrant women of this 
period. She attributed this neglect to the persistence of negative stereotypes 
which made it appear that women did little worth writing about. After these 
various calls, more attention was paid to women and migration (cf Simon & 
Brettell 1986). Now, the topic has mushroomed. Yet, while we know a great 
deal about the impact of women's position on other social outcomes such as 
fertility (cf Mason 1987), we have yet to dezelop a truly gendered understand- 
ing of the causes, processes, and consequerces of migration. Paying attention 
to the relationship between women's social position and migration will help 
fill the void regarding our knowledge of wcmen as immigrants and contribute 
to a greater understanding of the lives of women. It will also elucidate those 
aspects of the process of migration that were neglected by the exclusive focus 
on men. 

The study of immigration is by its very nature interdisciplinary. A natural 
division of labor has arisen whereby sociolegists attend most to contemporary 
immigration flows (the Latin American and Asian), historians are concerned 
with past flows (the Southern and Eastern European), and anthropologists 
relate to the impact of emigration and return on the sending communities in 
underdeveloped nations. Without seeking to be exhaustive, this review draws 
upon research from these various disciplmes and attempts to organize the 
literature to show how ethnicity, class, and gender interact in the process of 
migration and settlement. l 

As we try to bring women into the study of migration, it is worth consider- 
ing Judith Stacey & Barrie Thorne's (1€85) assessment of how different 
disciplines have incorporated gender. They stress that anthropology, history, 
and literature have all been more profouadly transformed than sociology, 

-while psychology, economics, and political science have all resisted 
transformation. Bringing women into the humanities and the social sciences 
takes place in stages: first, by filling in the gaps in knowledge resulting from 
their absence; second, by transforming the conceptual and theoretical 
frameworks of their disciplines (see also Louise Tilly 1989). 
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Traditionally, history chronicled the world of men in public places—in 
work, organizations, church, politics. In this history, women were included 
only when they left home and entered the labor force, took part in strikes, 
joined labor unions, or worked for suffrage, since only then did their activities 
become public and accessible to traditional research methodology (cf Wein- 
berg, in press). As Stacey & Thorne emphasized, feminist historians shifted 
the focus of research by placing women at.the center and writing the history of 
the private sphere and of the relationship between the public and the private. 
Likewise, literature was transformed by the inclusion of women and ethnic 
writers, changing the canon that defined the field. Anthropology has been the 
most deeply transformed by feminist contributions because the analysis of 
kinship and the sexual division of labor has always been its very core (cf 
Yanagisako & Collier 1987). Thus, feminist thinking matured and went from 
woman-centered analyses to providing what Stacey & Thorne (1985:305-6) 
call a “gendered” understanding of all aspects of human culture—one that 
traces “the significance of gender organization and relations in all institutions 
and in shaping men's as well as women's lives." 

Within sociology, by contrast, much research on women has filled in gaps 
and yielded new insights and directions, but the field itself has undergone 
little transformation, as a truly gendered understanding of most topics has not 
been achieved. Traditionally, sociology was neither totally male-defined, as 
history or literature, nor basically gender-sensitive, as anthropology. Thus, 
Stacey & Thorne judged that the feminist contributions to sociology were 
contained by the delimiting capacity of functionalism to explain male-female 
differences; of empiricism to treat gender as a variable, rather than as a central 
theoretical concept; of Marxist sociology to ghettoize it; and by the un- 
derdevelopment of feminist theory itself. 

Striving to contribute to a gendered understanding of the social process of 
migration, I have organized the various strands of this literature according to 
the following major issues: How is gender related to the decision to migrate— 
i.e. what are the causes and consequences of female or male-dominated flows 
of migration? What are the patterns of labor market incorporation of women 
immigrants—i.e. what accounts for their participation in the labor force and 
their occupational concentration? What is the the relationship of the public 
and the private—i.e. what is the impact of work roles on family roles and of 
the experience of migration on the immigrants themselves? 


THE DECISION TO MIGRATE 
Micro and Macro Linkages 


The underlying assumption in studies of migration has been the male pau- 
per—a single or married male who looks forward to amassing capital with 
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which to return to his native country. Thus, the corollary assumption has been 
that it is males who typically make the decision to migrate and that females 
follow. As Everett Lee (1966:51) most fully expressed it in his seminal 
“push” and “pull” theory of migration, "Indeed not all persons who migrate 
reach that decision themselves. Children are carried along by their parents, 
willy-nilly, and wives accompany their husbands though it tears them away 
from environments they love." Houstoun et al (1984:919) also stressed that, 
with the exception of domestics,, women “generally migrate to create or 
reunite a family;" they saw female-dominated flows of migration as secondary 
movements generated by the original migration of economically motivated 
young males. While women do migrate to join men and to create families (cf 
Watts 1983, Tyree & Donato 1986), this hardly constitutes an explanation for 
the fact that "during the last half-century, the traditional working-age immi- 
grant male has accounted for only a third of all immigration to the United 
States" (Houstoun et al 1984:913). Katharine Donato & Andrea Tyree (1986) 
analyzed the sex ratios of immigrants from over a hundred nations and 
examined the impact of American immigration law as well as the characteris- 
tics of migrants and their countries of origin. They concluded that, when 
viewed on the whole, family reunification accounted for the sex distribution 
of US immigration. In addition, the availability of jobs in the expanding 
health care industry also played a part. When viewed in the detail of national 
origins, however, a more complicated picture emerged with factors such as 
the presence of a US military base and the country's socioeconomic con- 
ditions also playing a role. 

Contrary to the manner in which the topic of women is usually treated in 
sociology, a truly gendered understanding of the social process of migration is 
provided by Sherri Grasmuck & Patricia Pessar's Between Two Islands (in 
press). Their analysis of contemporary migration from the Dominican 
Republic to New York City (the two islands) entailed and benefitted from the 
collaboration of a sociologist and an anthropologist, respectively. By focusing 
on the household and the relations between its members as these affect the 
decision to migrate, Grasmuck & Pessar's analysis also provided a much- 
needed link between the traditional micro explanations of migration and the 
recent macro explanations of it (cf Pedraza-Bailey 1990, Boyd 1989). 

In sociology, the traditional, individual micro approach was best developed 
by Lee's (1966:50) theory which focused on the individual migrant's decision 
to migrate—the "push" and "pull" factors that "hold and attract or repel 
people," as well as the intervening obstacles (distance, physical barriers, 
immigration laws, cost), the influence of personal traits (stage in the life 
cycle, contact with earlier migrants), and the effect of transitions (marriage or 
retirement). 

The more recent approach to the study of immigration focused on struc- 
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tural-level variables. The link between migration and world patterns of un- 
equal development increasingly became evident. North America remained the 
magnet that yesterday as well as today attracts the world's poor. In Western 
- Europe the periphery countries of Spain, Italy, Greece, and Turkey became 
suppliers of labor to the industrialized core countries of France, Germany, and 
Switzerland. Hence, a new set of structural, macro perspectives emerged. The 
structural perspective argued that a system of economic migration had de- 
veloped from the flow of labor between developed and underdeveloped 
nations due to the functions that this system of labor migration performed. 
Arguing independently but in a similar vein, Manuel Castells (1975), Michael 
Burawoy (1976), and Alejandro Portes (1978) agreed that immigrant labor 
had structural causes and performed important functions for the developed 
capitalist nation that received it. While replacement labor—the migration of 
low-skilled labor—provided countries such as the United States or France 
with a dependable source of cheap labor, it also provided countries such as 
Mexico or Turkey with a "safety valve" as emigration became the solution to 
their inability to satisfy the needs of their poor and lower-middle classes. 
Silvia Pedraza-Bailey (1985) sought to extend the structuralist approach to 
explain refugee flows as involving political functions that can also generate a 
system of political migration between sending and receiving societies, such as 
Cuba and the United States. Maria Patricia Fernandez-Kelly (1983) demon- 
strated yet another linkage that migration provides between developed and 
underdeveloped capitalist nations with her in-depth study of the maquiladoras 
in the US-Mexican border region. The maquiladoras are manufacturing 
industries that resulted from the Mexican border industrialization program, 
whose expressed intent was to provide employment opportunities for Mex- 
icans to help curtail undocumented migration. Fernandez-Kelly (1983:209) 
also emphasized that the most striking feature of this development was “its 
gender-specific nature. Although the majority of undocumented aliens work- 
ing the fields of the U.S. Southwest continue to be male, 85 percent of those 
working in the export-manufacturing plants along the Mexican border are 
female." Saskia Sassen-Koob (1984) sought to extend the structuralist 
approach to incorporate the migration of women by noting the intrinsic 
relationship between the recruitment of women into the new manufacturing 
and service jobs generated by export-led manufacturing in several Caribbean 
and Asian countries and the employment of immigrant women in highly 
industrialized countries, particularly in major cities which have undergone the 
shift to a service economy. 

The recent macro approach was an important corrective to the traditional, 
micro approach which failed to take into account that since the advent of the 
Industrial Revolution all individual decisions to move have resulted in migra- 
tion flows that moved in only one direction. The danger of the structural 
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emphasis, however, lies in its tendency to obliterate people, to lose sight of 
the individual migrants who do make decisions. The theoretical and empirical 
challenge now facing immigration research inheres in its capacity to capture 
both individuals as agents, and social structure as delimiting and enabling. 
We need to consider the plight of individuals, their propensity to move, and 
the nature of the decisions they make. We also need to consider the larger 
social structures within which that individual plight exists and those decisions 
are made. Such a link between micro and macro levels of analysis is provided 
by Massey et al (1987) in their analysis of Mexican migration to the United 
States. They showed that international migration originates historically in 
transformations of social and economic structures in sending and receiving 
societies, but that once begun migrants' social networks grow and develop. 
These networks support and channel migration on a continuously widening 
scale. Thus, the migration that was initially propelled by an external, structur- 
al dynamic and logic increasingly acquires an internal dynamic and logic of its 
own. In this way, migration comes to fuel itself. As families make migration 
part of their survival strategies and use it during stages of the life cycle when 
dependence is greatest, individual motivations, household strategies, and 
community structures are altered by migration, making further migration 
more likely. 

As Boyd pointed out, however, to date much of the recent research on 
networks has been indifferent to gender, to the fact that "the division of 
labor—the structured activities in a society—are gendered" (1989:656). Gras- 
muck & Pessar's analysis (in press) also focused on social networks and 
households as the link between micro and macro levels of analysis; they 
demonstrated that gender is central to household decision-making. Therefore, 
it is central to the decision to migrate, as a family strategy intended to meet 
the challenges that accompany underdevelopment and economic and political 
transformation in the Third World. As Grasmuck & Pessar emphasized, the 
household is the social unit that makes decisions as to whether migration will 
take place, who in the family will migrate, what resources will be allocated to 
the migration, what remittances or household members can be expected to 
return, and whether the migration will be temporary or permanent. All of 
these decisions are guided by normatively prescribed kinship and gender roles 
as well as by the hierarchy of power within the household (cf Hondagneu- 
Sotelo, in press). 

Both the studies of Massey et al and of Grasmuck & Pessar have the further 
virtue of a solid interdisciplinary research design. They merge the methodolo- 
gy of sociology (a survey) with that of anthropology (ethnography) in their 
research, thereby creating an ethnosurvey that overcomes the shortcomings of 
each (cf Massey et al 1987:11-13). In addition, within the country from 
which the migration originated (Mexico, Dominican Republic), they chose to 
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study, through both methods, very different types of communities: a tradition- 
al rural town, a commercialized agricultural community, an industrial town, 
and a city. This diversity best allowed an assessment of the social, economic, 
and cultural forces that contributed to the migration through comparisons of 
migrant with nonmigrant households. In the area of destination (California, 
New York City), drawing a representative sample of all immigrants was not 
possible, but snowball sampling led to the inclusion of both documented and 
undocumented immigrants. It is worth highlighting the methodology of both 
these studies because, unfortunately, there is a real methodological problem 
in this area of research. A number of very interesting studies of women 
immigrants collected in-depth interviews but from too small or unrepresenta- 
tive a sample, making their insightful conclusions less than precise and firm. 

Focusing on the household as the unit of analysis, Grasmuck & Pessar 
analyzed the household political economy as well as the power relations 
among its members that resulted in migration as a household strategy issuing 
from structural necessity. In the Dominican Republic, the transition from 
subsistence farming (based mostly on household labor) to commercial farm- 
ing (based on wage labor) disrupted the traditional family relations. Whereas 
earlier many sons had been an asset, they were now an obstacle, a problem 
that was solved by sponsoring a son's emigration. This change in the political 
economy of the area also held profound consequences for the women, since 
the more successful households with status aspirations were able to gain in 
status by freeing their women from agricultural work. However, the women 
then ceased to contribute economically to the family and also became totally 
dependent on their husbands. One way to solve the contradiction between 
local gender and class norms, on the one hand, and the goal of the migration 
strategy, on the other, was to sponsor the wife's emigration to the United 
States, where there was less resistance to women working outside the house- 
hold for wages. For the women themselves, the act of emigrating also became 
a way of escaping total dependence on their husbands. Thus, both sons and 
wives undertook the decision to migrate because of the gains in personal 
autonomy they anticipated, reducing the patriarchal control of fathers and 
husbands. In her participant observation study of Mexican undocumented 
women, Pierrette Hondagneu-Sotelo (in press) also found that while the 
decision to migrate may constitute a joint family strategy, the actual process 
of decision-making and the staggered departures of family members betray 
enormous interpersonal conflict. 

As Pessar (1986:276) pointed out, "In contrast to men, migration does not 
rupture the social sphere in which women are self-actualized," that of the 
household and family, Hence, not only was gender at the center of the 
decision to emigrate it was also at the center of the reluctance to return, as 
women struggled to maintain the gains that migration and employment had 
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brought them. Grasmuck & Pessar found that while men were eager to return, 
a desire expressed in their frugal, auste-e living to accumulate savings, 
women tended to postpone or avoid retum because they realized it would 
entail their retirement from work and the boss of their new-found freedoms. 
As a result, a struggle developed over finances and return that revolved 
around the traditional definitions of gender roles and privileges which the 
migration itself had challenged and which many men sought to regain by 
returning home. Many women spent large amounts of money on expensive, 
durable goods, such as a home and home furnishings, serving to root the 
family securely in the United States and :o deplete the funds necessary to 
relocate. Through their use of interdisciplimary methods and understandings, 
Grasmuck & Pessar reached a depth of ana.ysis each method alone could not 
have sounded and produced a truly gendered understanding of the social 
process of migration. 


The Demographic Composition of the Migration Flow 


Emigration is a process experienced differeatly by women and men; hence, it 
can be sex-selective. As Joy Parr (1987:539) underlined, “emigration can be 
the product of sex imbalances; it also forms them, both in the old country and 
the new.” Examining international migration patterns, we can see that not 
only have women comprised at least half o£ recent immigrants to the United 
States, they have also predominated amoag immigrants to Argentina and 
Israel and now constitute an increasing skare of migrants in areas such as 
West Africa and the Persian Gulf states (Tvree & Donato 1986). Examining 
internal migration patterns, we can see that whereas in Africa men predomi- 
nate in migration to the cities and women remain in rural areas to farm the 
land, in Latin America, the Caribbean, amd Philippines, most migrants to 
cities are women. In South Asia men outnuriber women substantially in cities 
(see Fernandez-Kelly 1983, Reiff et al 1983, Gabaccia 1987, Khoo et al 
1984, Lee 1989, Hojman 1989, Gugler 1989). These contrasts are largely due 
to the nature of the existing land tenure and agricultural production arrange- 
ments. Moreover, profound change can take place over time, as in sub- 
Saharan Africa where, in Joseph Gugler's 11989) analysis "Women stay on 
the farm no more." This change resulted from the increased opportunities in 
cities—in income and employment for women together with medical care and 
education for children—at the same time tlzat agriculture became less profit- 
able as a result of state pricing policy for cesh crops. Moreover, government 
policy can create imbalanced migration flows by legally restricting the migra- 
tion of males or females, as in Lesotho Wilkinson (1983) showed the 
consequences for the women of Lesotho from neighboring South Africa's 
influx control laws that excluded women. As a result, a male-dominated flow 
of migrant labor to work in the mines crossed the international boundary of 
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Lesotho and also generated a female-dominated internal flow of migration to 
the cities of Lesotho where phenomenal sex imbalances occurred. 


MALE EMIGRATION Flows of migration that are dominated by men require 
that we consider “the woman’s side” when the women themselves are left 
behind in the communities. Caroline Brettell (1988) analyzed the impact of 
longstanding male emigration from Portugal to Brazil and Spain in the late 
nineteenth and twentieth centuries. She used the household data kept by the 
parish priest of Santa Eulalia de Lanheses in Portugal to assess the impact of 
the emigration on the matricentric characteristics of the way of life in this 
area, particularly the formation and structure of households. Households 
adjusted to the centuries of absence of men by becoming female-headed, often 
extended three-generation households, and, contrary to the Mediterranean 
pattern, patriuxorilocal, since grooms often moved in with the wife’s family. 
Male migration also affected the lives of women by promoting a delayed age 
at marriage and high rates of spinsterhood as well as illegitimacy, both of 
which bound the women more firmly to their families of origin. 

Barbara Engel (1986) also attended to “the woman’s side” as she analyzed 
the impact, on the village, the family, and the women. left behind, of the 
emigration of men from the villages of Kostroma province to the cities of 
Russia in the late nineteenth century. As peasants, the family was an eco- 
nomic unit of production, not just of consumption, and marriage to a village 
girl tied the young man who left more firmly to the family-economy as the 
wife continued to work in agriculture, in his place, and to live with her 
in-laws. Families thus placed greater constraints on women's migration than 
they did on men's; hence, few women emigrated. As a family strategy the 
out-migration of men that retained wives laboring in the village had many 
advantages. Marrying a man who worked elsewhere made women's lives 
more difficult in some ways as it tied them tighter to the soil and increased 
their burdens, but it also made women's lives easier in other ways as they 
endured childbirth less often and their children survived more often due to 
better health and nutrition. Moreover, the marriage relation itself may well 
have benefited as women acquired more control over their lives and a 
measure of dignity while the marriage became more of a partnership. 


FEMALE EMIGRATION Flows of migration that are dominated by women are 
often neglected, yet it is quite common for refugee flows to be initially 
dominated by women and children, as in the early years of both the Cuban and 
Indochinese exodus to the United States—an imbalance that altered over time 
(Pedraza-Bailey 1985, Rumbaut 1989). Refugees initially lack a motivation to 
settle elsewhere and only when they come to perceive the existing political 
situation as intolerable do they feel forced to leave, a choice they would rather 
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not have made (cf Stein 1981:322, Rose 1981:8). Thus, refugees often suffer 
from the mentality of those caught in a sinking ship—to reach safety, women 
and children first. Secondary migrations of men, who move to reunite fami- 
lies, are then generated. 

Ultimately, the demographic composition of migration flows is important 
not only because its causes are various but also because of its consequences. 
In his comparative analysis of Italian and Jewish mobility in New York at the 
turn of the century, Thomas Kessner (1977) underscored that their patterns of 
social mobility and attainment depended on the varying composition of the 
migration flows. Newcomers that arrive as temporary migrants—as "birds of 
passage," in Michael Piore's (1979) phrase—work with the goal to return 
home, tolerating the most abysmal working conditions to accumulate capital 
for their investments back home. By contrast, permanent immigrants must 
make their future in the new land and cannot tolerate abysmal working 
conditions by thinking they are temporary. Thus, they seek to attain social 
mobility in the new society, taking greater risks and making more long-term 
investments, such as setting up family businesses (Piore 1979:55—68). The 
two types of migration are reflected in the demographic composition of the 
flows. Flows of temporary migrants, such as the Italian, are by and large 
nonfamily movements of males in the productive years who intend to make 
money and return home. By contrast, flows of permanent immigrants, such as 
the Jewish, are characterized by the migration of families who intend to 
remake their lives and homes (Kessner 1977). 

Focusing on both the causes and consequences of a female-dominated flow 
of migration, Hsia Diner (1983) studied Irish immigrant women in the 
nineteenth century. The Irish migration was pushed by conditions that pre- 
vailed throughout much of Europe then—poverty, landlessness, and the social 
and economic dislocations that accompanied the transition from an agrarian 
feudal society to an industrial, capitalist society (cf Bodnar 1985). These 
conditions were exacerbated by the famine of the late 1840s. Coupled with the 
Irish system of single inheritance and single dowry, Ireland increasingly 
became the home of the unmarried and the late married. More than half of the 
Irish immigrants to the United States were women, and as the century wore on 
the migration became basically a female mass movement. As Diner (1983:4) 
demonstrated, the root cause was that social and economic conditions in 
Ireland were such that "Ireland became a country that held out fewer and 
fewer attractions to women." Women had few realistic chances for marriage 
or employment; to attain either most had to turn their backs on the land of 
their birth. Hence, not just famine and poverty but what Jackson (1984:1007— 
8) called "the interlocking relationship of land-family-marriage" caused the 
preponderance of women in the migration. As a consequence of land scarcity, 
both arranged marriages and the practice of dowries spread, and celibacy and 
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late marriages rose. One escape from family and spinsterhood was for women 
to join a religious order; another was emigration. 

Consequently, the Irish exodus to the United States was predominantly 
female and young, a migration stream mostly composed of single persons. 
The usual kin chain migration became a female migratory chain in which 
women brought over other women—sisters, mothers, nieces, aunts, friends. 
As will be seen later, the major consequence of the predominantly female and 
single nature of the migration was that Irish women were able over- 
whelmingly to enter domestic service. 


THE INCORPORATION OF WOMEN 


Labor Force Participation 


That immigration has a decided impact on the labor force participation of 
women is a central fact of immigration research. It is also one of the major 
issues in studies of Cuban immigrants (Perez 1988). In contrast to the very 
low rates of labor force participation of women in Cuba prior to the revolu- 
tion, and of Mexican and Puerto Rican women in the United States at present, 
Cuban women who emigrated to the United States have had a very high rate of 
labor force participation (Sullivan 1984). Yolanda Prieto (1987) interviewed a 
sample of Cuban-born women in New Jersey, a more working-class commu- 
nity than Miami, and concluded that the major determinant of the massive 
entrance of the women into the labor force was their original social class (cf 
Garcia-Castro 1986). These women were middle-class either in their origin. 
or, if working class, in their aspirations. Achieving the upward mobility of the 
Cuban family in the United States made women's work necessary and broke 
with the traditional Cuban notion that a woman's place is in the home, 
justifying the massive entrance of women into the labor force. 

Indeed, Cuban women overwhelmingly saw work as the opportunity to 
help the family, rather than as an opportunity for self-actualization. Thus, 
Myra Max Ferree (1979:48) wrote that Cuban women were an example of 
employment without liberation. Cubans had apparently stretched the tradi- 
tional view of women existing for the family to include employment as part of 
that role, while implying no necessary change in values. Ferree emphasized 
that the seeming ease with which this occurred ought to cast doubt on cultural 
explanations for the low labor force participation of Hispanic women (cf 
Santana-Cooney & Ortiz 1983, Reimers 1985). However, Lisandro Perez 
(1988) underscored the need to look at generational differences. The first 
generation reared in the traditional culture may well view employment as 
instrumental, but the second generation, more American, might hold a differ- 
ent set of attitudes. Perez's (1986) research also showed that whereas Cubans 
have a relatively high family income, they do not compare so favorably in 
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terms of individual income, pointing to the importance of the dual-income 
family in the formation of what has come to be known as the Cuban “success 
story." 

Numerous research studies have also examined the labor market outcomes 
of immigrant women—the occupations and income they attained and the 
disadvantages reflected in different "payoffs" to their characteristics (e.g. 
Tienda et al 1984, Sullivan 1984, Boyd 1984, Evans 1984, Lichter 1983). 
Most of these studies rely on large data sets (such as the censuses or the 
National Longitudinal Survey), and quantitative analyses, rendering their 
results persuasive. But, by and large, they suffer from the problem Stacey & 
Thorne identified as the way sociology has incorporated women: they treat 
gender as a variable, rather than as a central theoretical principle. Moreover, 
many of these studies tend to make gross comparisons between all immigrants 
vs all native-born (e.g. Tienda et al 1984, Maxwell 1988, Morrison & Lichter 
1988) that fail to take into account the substantial variation in the causes and 
consequences of the incorporation of different immigrant groups (e.g. Cubans 
vs Mexicans) and of the native-born (e.g. whites vs blacks). 


Occupational Concentration 


The concentration of certain immigrant or ethnic groups in particular types of 
occupations is a central fact that a theory of the incorporation of racial or 
ethnic groups needs to explain (cf Feagin 1978). Like men, immigrant women 
became occupationally concentrated but along a much smaller spectrum of 
choices. Although immigrant women can be found doing the hard labor of 
construction in some societies (see Lee 1989), most of them cluster in just a 
few occupations. They become domestic servants, work for the garment 
industry, donate their labor to family enterprises, or most recently, work in 
highly skilled service occupations, such as nursing. 


DOMESTIC SERVICE As Mary Garcia-Castro (1986) emphasized, migration 
has different meanings for married women with or without children than for 
single women. Diner specified that the major consequence of the pre- 
dominantly female and single nature of the Irish migration was that Irish 
women overwhelmingly entered domestic service, an occupation in which 
there was a “labor vacuum” because others did not want it. Native, Anglo- 
Saxon Protestant women considered that type of work demeaning. Other 
immigrant or poor women, most of whom were married, found that the 
expectation that they live in interfered with family life. While the work, 
indeed, lacked authority, Diner stressed the advantages it held for poor 
women. The work environment was healthier and safer than that of a factory; 
the job allowed women to be employed, unlike the men, at times of severe 
economic recession; the environment exposed the women faster to middle- 
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class standards and lifestyles, promoting their Americanization; and, above 
all, it allowed women to amass savings at an impressive rate. Those savings 
were used to bring other female relatives over; to send remittances back home 
that would help the family pay off their farm mortgages; to support the 
Catholic church and their favorite devotions; to secure the "nest egg" for an 
American marriage; and to finance their upward mobility by providing the 
foundation for a small business or an education that would lead them or their 
daughters into teaching, nursing, or stenography. Thus, Irish women ex- 
perienced higher rates of social mobility than Irish men. As Donna Gabaccia 
(1989) noted in her recent review essay, Diner's in-depth study of the women 
of a particular immigrant group is the sort of research needed before going on 
to attempt synthetic comparisons of various groups of immigrant women over 
long spans of time (e.g. Weatherford 1986) that result in overly general 
statements about all who are “foreign and female." However, case studies that 
remain inconclusive regarding the uniqueness or similarity of that group to 
others (e.g. Lindstrom-Best 1988) are equally problematic. More in-depth 
case studies and controlled comparisons of a couple of groups are still needed 
to lay the foundations for future syntheses. 

Numerous other studies focus on immigrant women who overwhelmingly 
entered domestic service. Evelyn Nakano Glenn's (1986) study of three 
generations of Japanese Women in domestic service, Issei, Nisei, War Bride, 
depicts domestic service not as the occupational incorporation of single 
women resulting from both choice and circumstance, but as one of the few 
occupations open to women of color in American history. Glenn emphasized 
that the.labor market allocates certain types of jobs to certain types of 
people—by gender, color, and class—reserving unskilled, unprotected, poor- 
ly paid jobs for women and people of color. She also examined the changes 
that did take place over three generations of Japanese women. The first 
generation immigrated between 1915 and 1924, one of the peak moments in 
the history of American racism, so they had few options other than agricultur- 
al work in the fields or domestic service. The second generation, their 
children, benefited from a more open society but suffered the vicissitudes and 
dislocations the internment produced. For some older nisei, the post-war 
period was one of frustration. Their horizons had expanded with the employ- 
ment in the internment camp, only to see them shrink again as they were 
forced subsequently to return to domestic service. The war brides were a more 
socially heterogeneous cohort and one who, in comparison, suffered less. 


GARMENT INDUSTRY Among yesterday's immigrants, as well as today's, 
women became concentrated in the garment industry. This industry relied on 
a traditional skill that throughout much of the world defined womanhood; 
and, moreover, it relied on homework and subcontracting, allowing women to 
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stay at home with their children. This advantage led women to accept low 
wages and exploitative conditions (Howe 1976). In his study of New York's 
garment industry, Roger Waldinger (1986:50) pointed out that, starting in the 
late nineteenth century, a market for ready-made mass-produced women's 
clothing was created by urbanization, the development of a national market, 
and increasing population. New York became the leading center of the 
garment industry, its growth spurred by the arrival of massive waves of 
immigrants, in particular Russian Jews and Italians. The Jews and Italians 
flowed into the city just at the time when the demand for ready-to-wear began 
to surge. Since many of the Russian Jews had previously developed skills in 
the needle trades, garments quickly became the Jewish trade. As Waldinger 
noted (1986:51), "among the Jews, the garment shop was an employer of both 
men and women. Men went into coats and suits, the staple items of the 
garment business up to 1910. Shirtwaists, undergarments,. and children's 
clothes, the lighter trades that developed after 1900, became the province of 
Jewish immigrant women." Puerto Rican immigrant women also became 
concentrated in the garment industry (Sanchez-Korrol 1983). Immigrant 
women worked either in factories or as homeworkers. The availability of an 
immigrant labor force— poor, industrious, and lacking in other skills—made 
the development of a new system of production possible. Waldinger 
(1986:51) specified that the industry adapted itself to the newcomers by 
having them work for smaller subcontractors, who often housed their factories 
in the tenements where the immigrants lived. The subcontractors formed a 
convenient intermediary between the newly arrived working class and the 
established manufacturers, with the subcontractors specializing in recruiting 
and mobilizing labor. Thus *the tendency to divide functions between man- 
ufacturing and contracting has remained a distinctive feature of the industry 
up to this day." This feature has always depended on the availability of 
women immigrants who preferred working in the home in order to care for 
and supervise their children. Women immigrants also played a critical role in 
the achievement of unionization in the industry (Howe 1976). Today, immi- 
grant women newly arrived from Latin America and Asia continue to supply 
the labor for the garment industry (e.g. Pessar 1984, Safa 1984). 

Maria Patricia Fernandez-Kelly & Anna Garcia (in press) compared the 
superficially similar work of Mexican and Cuban women in the Los Angeles 
and Miami garment industries to elucidate the contrasting interaction of 
ethnicity, class, and gender. At stake were two very different processes of 
labor market attachment of immigrant groups and power relations within the 
family. Mexican immigration to the United States is the sustained migration 
of unskilled and semiskilled replacement labor, while the Cuban migration to 
the United States is the migration of skilled Cuban political refugees that led 
to the creation of an ethnic enclave in Miami (Portes & Bach 1985). Thus, 
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Mexican women immigrants worked in the garment industry due to the 
long-term financial need generated by their husbands’ inadequate earnings, or 
the total loss of male support due to illness, death, or abandonment that turned 
these women into heads of households. Hence, their work in the garment 
industry has been the imperative posed by survival. By contrast, Cuban 
women immigrants worked in the garment industry as a transitory experience 
aimed at recovering the family's lost middle-class level of living by helping 
their husbands become self-employed ir business. Different types of male- 
female power relations in the family were also implicated. Fernandez-Kelly 
and Garcia emphasized that the disillusionment that Mexican women ex- 
perienced with men could lead them to aspire to personal fulfillment and 
independence. However, those aspirations could only be thwarted by their 
poverty. In the case of Cuban women, once their family's aspirations for a 
middle-class level of living was achieved, men pressured their wives to stop 
working outside the home. Husbands had only allowed their wives to have a 
job out of necessity. While the study does not take into account that the 
experience of work perforce changes those traditional definitions of gender 
and family roles (cf Pessar 1986), it does underline the necessity to apprehend 
how ethnicity, class, and gender interact. 

Another comparative study of women immigrants in the garment industry is 
Louise Lamphere's (1987) historical comparison, From Working Daughters . 
to Working Mothers. At the turn of the century, most employed women were 
young, unmarried, lived at home, and their wages helped support working- 
class families. In the Central Falls, Rhode Island, town on which she focused 
her research, the girls were from French Canadian, Irish, English, Scottish, 
and Polish backgrounds and worked in textile mills. In recent years, most 
employed women have been married with small children, and came from 
Portugal and Colombia. Thus, the decline of the female labor force from one 
composed of working daughters to one composed of working mothers was 
intertwined with the history of immigration as well as with the transformation 
of industrial manufacturing over time. Throughout, Lamphere gives equal 
attention to women's lives in the textile mill (productive labor) and at home 
(reproductive labor), and to their strategies of resistance, accommodation, 
and consent. 


ETHNIC ENTERPRISE Immigrant women also make an invaluable contribu- 
tion to the development of immigrant enterprises. Ethnic enterprise—the 
occupational concentration of certain immigrant groups in small-business— 
describes the historical immigrant experience, such as that of Jews and 
Chinese in the early part of the century. Today, Koreans are a clear case of the 
same process. Self-employment bears a relationship to the incidence of 
poverty: Groups that exhibited high rates of self-employment also exhibited 
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low rates of poverty, of which the Jewish-American experience is the pro- 
totype. 

Ivan Light (1979, 1984) has underscored the immigrant nature of ethnic 
enterprise by suggesting that the disadvantages immigrants face in competing 
in the new labor market (due to their inability to speak the language, foreign 
credentials, or racial discrimination) may push them into self-employment. 
Yet, as Light (1984:198—99) himself acknowledged, disadvantage cannot be 
the whole explanation of this phenomenon, because the foreign-born have 
higher rates of self-employment than do most disadvantaged ethnic minori- 
ties, especially blacks. What accounted for this disparity, Light concluded, 
was that immigrants were not simply disadvantaged, but had shared the 
common experience of migration, which in turn produced a reactive group 
solidarity that did not exist prior to migration. Thus, immigrants who sought 
out self-employment in reaction to what Aldrich et al called "occupational 
closure" (1984) could draw on the resource of access to cheap family and 
ethnic labor. The unpaid family labor donated by women is what allows 
immigrants to amass profits and turn them into savings that are reinvested in 
the development and growth of family businesses. Thus, women's contribu- 
tion is the key to the success of these enterprises and to the achievement of the 
petit bourgeois class position (Phizacklea 1983). 


"BRAIN DRAIN" Despite the growing importance today of flows of skilled 
and highly educated immigrants that enter the primary labor market (cf Portes 
1981), this area of immigration studies is underresearched. Thus, although 
women figure importantly as technicians, teachers, doctors, and nurses, most 
studies to date only chronicle the number of women exiting different countries 
(e.g. Mejia et al 1979). Eui Han Shin & Kyung-Sup Chang's (1988) study of 
how Korean immigrant physicians are articulated into the American medical 
profession found that women physicians were much more likely to immigrate 
to the United States than men. Moreover, while all immigrant physicians were 
more likely to enter the peripheral specialties of American medicine, gender 
contributed significantly to that peripheralization. 


The Public and the Private 


WORK AND FAMILY As a result of the recent incorporation of women into 
the field of history, a shift has taken place from writing histories that were 
only about the public sphere to histories that are also about the private sphere 
and the relationship between public and private. Research on immigrant 
women, therefore, needs to chronicle both the private world of immigrant 
women and their community, and the contribution immigrant women made to 
the private sphere for native middle-class women and their families. A good 
example of the former is Sydney Weinberg's (1988) The World of Our 
Mothers, so deliberately a corrective to Irving Howe's (1976) The World of 
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Our Fathers on the Jewish immigrant experience in New York city at the turn 
of the century.:A good example of the latter is Matthews’ (1987) analysis of 
Just a Housewife: The Rise and Fall of Domesticity in America. 

The lives of Jewish immigrant women, centered on the domestic sphere, 
differed from the lives of men, defined by work and the synagogue. Weinberg 
took the oral histories of 46 of these women to portray their daily lives. Piety 
played an important part, with ritual and ceremony marking their daily actions 
as they kept an Orthodox home. Being a Talmudic scholar was not an option 
open to women, but women presided over the domestic religion. Education 
was also highly valued, and a good woman normatively made sacrifices to 
help her husband and sons become scholars. Indeed, contrary to other cultural 
traditions, it was not unusual for women to work outside the home as garment 
workers, or as landladies, or as helpers with the family business for their 
families’ benefit. Moreover, immigrant women played a mediating role be- 
tween the old world and the new. Immigration exposed daughters to the ways 
of a modern, secular world they were eager to accept. Although mothers 
themselves clung to traditional, Orthodox ways, within the family these 
women played the role of mediators between fathers and daughters. As 
Weinberg (1988:148) emphasized, we might not easily understand today the 
satisfactions of those who lived for and through others, but the services and 
sacrifices of these mothers left a deep impression on their daughters. 


DOMESTICITY The labor that immigrant women supplied as servants con- 
tributed to the changing role of the housewife in America. Matthews (1987) 
pointed out that the cult of domesticity arose in the early to mid-nineteenth 
century among middle class women because the availability of domestic 
servants allowed time for the development of the arts of baking and needle- 
work. Time became moré abundant also because of changing attitudes about 
the allocation of tasks between mistress and maid. While, historically, women 
had relied on other women as "help," and worked side by side with them on 
domestic chores, from the 1820s on, "domestic servants" that required 
supervision replaced the "help"—a change that was facilitated by the increas- 
ing number of poor immigrant women coming to America. However, by the 
late nineteenth century a "servant problem" developed because by this time 
servants tended to be drawn from the ranks of the lowly and despised, 
especially the Irish. In the South, of course, servants were black. Matthews 
documents that the relationship between mistress and maid changed when 
ethnic and religious differences became so much more marked than was the 
case when migrant farm girls provided the domestic help. 


MARITAL SATISFACTION The seasonal migration that some women were 
involved in constantly exposed them to "double lives" in two societies and 
cultures. Sylvia Guendelman & Auristela Perez-Itriago (1987:250) found that 
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the impact of migration was experienced very differently by working and 
nonworking women since work had "particularly strong repercussions in 
aligning marital relationships." For example, immigrant Mexican women 
who worked outside the home for wages tended to establish cooperative roles 
with their husbands, sharing power, decision-making, and activities, whereas 
women who did not work became increasingly dependent on their husbands to 
help them negotiate the host society. However, when the working women 
who had established cooperative relationships returned to Mexico, a shift took 
place from cooperative to separate roles as their husbands regained the 
dominant position in the family's relationships with the outside world and 
became uninvolved in childrearing and domestic duties. Thus, the distance 
between spouses increased. By contrast, when the nonworking women who 
had grown increasingly dependent on their husbands in the United States 
returned to Mexico, they experienced improved psychological health and 
reported liberating feelings of release. Guendelman and Perez-Itriago astutely 
utilized seasonal migration as the underpinning of their research design to 
show the impact of work on both marital roles and satisfaction. 


RELIGIOSITY The deeply felt needs of immigrant women also found expres- 
sion in their popular religious tradition. In his analysis of the devotion Italian 
immigrants poured onto The Madonna of 115th Street, Robert Orsi 
(1985:204—5) underscored that while the Madonna came from Italy with the 
immigrants, and as such was a symbol to all Italian immigrants of nation, 
history, and tradition, above all she was a woman's devotion—both because 
women were its main participants and because "it emerged out of and 
reflected the special role and position of women in Italian culture." In Italian 
Harlem, the Madonna also became an expression of the lives of immigrant 
women as these women turned to the Madonna with petitions for help with the 
hardship and powerlessness of their lives—as women bound by a strong, 
patriarchal tradition, and as immigrants mired in poverty, toil, and trouble. 
That private relation became public at the annual festa, when both men and 
women participated as a community, that served to regenerate their culture as 
Italians and to console them for the physical and spiritual trials of immigra- 
tion. 


MENTAL HEALTH The rupture, separation, and loss that is part of any 
migration affects the mental health of all immigrants, but women experience 
it differently from men. As Rogler et al (1987) formulated it, migration 
induces deep and continuous strains that come from the difficulties encoun- 
tered in entering a new economic system and culture, and changing one's 
personal ties. To this, Vega et al (1987) and Rumbaut (1989) added the 
circumstances of leaving one's country of origin and the trauma of the passage 
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itself. These strains are reflected in the immigrants’ mental health— 
psychological distress or depression. Between the strains migration induces 
and the distress immigrants feel are life-event changes (e.g. coping styles, 
social support networks) that shape the immigrants' adaptation. Mental health 
research tends to be quantitative with several standard scales in wide use (e.g. 
the CES-Depression Scale). In her study of Mexican immigrant women, V. 

Nelly Salgado de Snyder (1987) developed an Acculturative Stress scale that 
showed the much higher stress women experienced than men. 

The first in-depth empirical study of the refugee experience is Rumbaut's 
(1989) study of 750 Indochinese refugees (Vietnamese, Khmer, Hmong, 
Chinese-Vietnamese, Laotian). Rumbaut focused on the different effects the 
refugee experience had on women and men (see also Kay 1988). For ex- 
ample, for men the death of close family members significantly predicted 
their depression, while for women it was separation from close family mem- 
bers, especially if these had been imprisoned. For women, having relatives 
living in the house decreased their depression, underscoring the buffering 
effects of family support, while for men it was being married and having close 
Indochinese friends. Moreover, women were found to play a pivotal role in 
the refugee family, influencing the economic and psychological adaptation of 
all its members. 


SUMMARY AND CONCLUSIONS 


As we have shown, gender plays a central role in the decision to migrate and 
the composition of the migration flows, with the consequences that composi- 
tion holds for the subsequent form of immigrant incorporation. The experi- 
ence of immigration also profoundly impacts the public and private lives of 
women-—their labor force participation, their occupational concentration, 
their religiosity, their marital roles and satisfaction, and their autonomy and 
self-esteem. Hence, we can see that the experience of immigration holds 
different benefits for women than for men (cf Kats 1982). Diner's portrayal of 
Irish women clearly leads to the conclusion that the experience of immigration 
was quite beneficial for these women, making us see the Irish migration as 
more "successful" than otherwise. More often than not, women realized their 
ambitions and aspirations and had better opportunities than the men. They 
became educated earlier and with more gusto, and by and large they "Amer- 
icanized" more thoroughly and more enthusiastically. Nancy Foner's (1978) 
study of Jamaican women in London also noted that, difficult as the experi- 
ence of immigration was, it was often far more positive for women than for 
men, as it allowed women to break with traditional roles and patterns of 
dependence and assert a new-found (if meager) freedom. Patricia Pessar's 
(1984) study of Dominican women immigrants suffers from too small a 
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sample size but is suggestive. Pessar notes that when women who had 
previously not worked in the Dominican Republic went for the first time to 
work outside the home in the United States, this change had other important 
effects. Patriarchal roles in the household were transformed, the women's 
self-esteem was heightened, their capacity to participate as equals in house- 
hold decision-making was enhanced, and they secured more income with 
which to actualize their roles. However, the employment did not provide 
women with a new status as working women that challenged or subordinated 
their primary identities as wives and mothers. Rather, it often reinforced these 
very identities as it allowed women to redefine them in a more satisfying 
manner than prior to the migration. 

Throughout this review I have highlighted the recurrent problem of, on the 
one hand, highly suggestive studies that rely on small, often unrepresentative 
samples, rendering interesting ideas and conclusions open to question, and, 
on the other hand, studies where the sample sizes are large and representative 
and their findings, therefore, persuasive, but which treat gender as a variable, 
rather than as a central organizing principle. This is a methodological problem 
that the social sciences need to confront head on. Perhaps immigration 
research, due to its intrinsically interdisciplinary nature, is the arena in which 
to do so. 


ACKNOWLEDGMENTS 


The author expresses her gratitude to Alejandro Portes, Ruben Rumbaut, 
Anne Gere, Fatma Muge Gocek, and Eliana Moya-Raggio for their comments 
and suggestions, and to the University of Michigan's Summer Research 
Opportunity Program and Maria Barboza for her technical assistance. 


Literature Cited 


Aldrich, H. E., Jones, T. P., McEvoy, D. 
1984. Ethnic Advantage and Minority 
Business Development. In Ethnic Commun- 
ities in Business, ed. R. Ward, R. Jen- 
kins, pp. 189-210. Cambridge Univ. Press. 
270 pp. f 

Bodnar, J. 1985. The Transplanted: A History 
of Immigrants in Urban America. Bloom- 
ington, Ind: Ind. Univ. Press. 294 pp. 

Boyd, M. 1989. Family and personal networks 
in international migration: recent develop- 
ments and new agendas. Int. Migr. Rev. 
23:638—70 

Boyd, M. 1984. At a disadvantage: the occu- 
pational attainments of foreign born women 
in Canada. Int. Migr. Rev. 18:1091-119 

Brettell, C. B. 1988. Emigration and house- 
hold structure in a Portuguese parish, 1850— 
1920. J. Fam. Hist. 13:33-57 


Burawoy, M. 1976. The functions and repro- 
duction of migrant labor: comparative mate- 
rial from Souther Africa and the United 
States. Am. J. Sociol. 81:1050-87 

Castells, M. 1975. Immigrant workers and 
class struggles in advanced capitalism: the 
western European experience. Polit. Soc. 
5:33-66 

Diner, H. R. 1983. Erin's Daughters in Amer- 
ica: Irish Immigrant Women in the Nine- 
teenth Century. Baltimore, Md: Johns Hop- 
kins Press. 192 pp. 

Donato, K. M., Tyree, A. 1986. Family reuni- 
fication, health professionals, and the sex 
composition of immigrants to the United 
States. Sociol. Soc. Res. 70:226-30 

Engel, B. A. 1986. The woman's side: male 
out-migration and the family economy in 
Kostroma Province. Slavic Rev. 45:257-71 


Evans, M. D. R. 1984. Immigrant women in 
Australia: resources, family, and work. Int. 
Migr. Rev. 18:1063-90 

Feagin, J. R. 1978. Racial and Ethnic Rela- 
tions. Englewood Cliffs, NJ: Prentice Hall. 
392 pp. 

Fernandez-Kelly, M. P., Garcia, A. 1991. 
Power surrendered, power restored: the 
politics of home and work among Hispanic 
women in Southern California and Southern 
Florida. In Women, Change, and Politics, 
ed. L. A. Tilly, P. Gurin. New York: Rus- 
sell Sage Found. In press 

Fernandez-Kelly, M. P. 1983. Mexican border 
industrialization, female labor force partici- 
pation and migration. In Women, Men, and 
the International Division of Labor, ed. J. 
Nash, M. P. Fernandez-Kelly, pp. 205-23. 
Albany, NY: SUNY Press. 463 pp. 

Ferree, M. M. 1979. Employment without 
liberation: Cuban women in the United 
States. Soc. Sci. Q. 60:35-50 

Foner, N. 1978. Jamaica Farewell: Jamaican 
Migrants in London. Berkeley: Univ. Calif. 
Press. 262 pp. 

Gabaccia, D. 1989. America's immigrant 
women: a review essay. J. Am. Ethnic Hist. 
8:127-33 

Gabaccia, D. 1987. In the shadows of the 
periphery: Italian women in the 19th cen- 
tury. In Connecting Spheres: Women in the 
Western World, 1500 to the Present, ed. J. 
Quataert, M. Boxer, pp. 166-75. New 
York: Oxford Univ. Press. 281 pp. 

Garcia-Castro, M. 1986. Work versus life: 
Columbian women in New York. In Women 
and Change in Latin America, ed. J. Nash, 
H. Safa. South Hadley, Mass: Bergin & 
Garvey 

Glenn, E. N. 1986. Issei, Nisei, Warbride: 
Three Generations of Japanese American 
Women in Domestic Service. Philadelphia, 
Penn: Temple Univ. Press. 290 pp. 

Grasmuck, S., Pessar, P. R. 1991. Between 
Two Islands: Dominican International Mi- 
gration. Berkeley: Univ. Calif. Press. In 
press 

Guendelman, S., Perez-Itriago, A. 1987. 
Double lives: the changing role of women in 
one migration. Women's Stud. 13:249— 


Gugler, J. 1989. Women stay on the farm no 
more: changing patterns of rural-urban 
migration in sub-Saharan Africa. J. Mod. 
Afr. Stud. 27:347-52 

Hojman, D. 1989. "Land reform, female 
migration and the market for domestic ser- 
ut in Chile." J. Lat. Am. Stud. 21:105- 


2 
Hondagneu-Sotelo, P. 1991. Family and com- 
munity in the migration of Mexican un- 
documented immigrant women. In Ethnic 
Women: A Multiple Status Reality, ed. M. 


WOMEN AND MIGRATION 323 


T. Segal, V. Demos. Dix Hills, NY: Gener- 
al Hall. In press. 

Houstoun, M. F., Kramer R. G., Barrett, J. 
M. 1984. Female predominance of im- 
migration to the United States since 1930: a 
first look. Int. Migr. Rev. 18:908-63 

Howe, I. 1976. World of Our Fathers. New 
York: Simon & Schuster. 714 pp. 

Jackson, P. 1984. Women in 19th century Irish 
emigration. Int. Migr. Rev. 19:1004-20 

Kats, R. 1982. The immigrant woman: double 
cost or relative improvement? Int. Migr. 
Rev. 16:661—77 

Kay, D. 1988. The politics of gender in exile. 
Sociology 22:1-21 

Kessner, T. 1977. The Golden Door: Italian 
and Jewish Immigrant Mobility in New York 
City, 1880-1915. New York: Oxford Univ. 
Press. 224 pp. 

Khoo, S. E., Smith, P. C., Fawcett, J. T. 
1984. Migration of women to cities: the 
Asian situation in comparative perspective. 
Int. Migr. Rev. 18:1247-63 

Lamphere, L. 1987. From Working Daughters 
to Working Mothers: Immigrant Women in a 
New England Industrial Community. Ithaca, 
NY: Comell Univ. Press. 390 pp. 

Lee, E. S. 1966. A Theory of Migration. De- 
mography 3:41-57 

Lee, S. M. 1989. Female immigrants and 
labor in colonial Malaya: 1860-1974. Int. 
Migr. Rev. 23:309-31 

Lichter, D. T. 1983. Socioeconomic returns to 
migration among married women. Soc. 
Forces 62:487--503 

Light, I. 1984. Immigrant and ethnic enter- 
prise in North America. Ethnic Racial Stud. 
7:195-216 

Light, I. 1979. Disadvantaged minorities in 
self-employment. Int. J. Comp. Sociol. 
20:31-45 

Lindstrom-Best, V. 1988. Defiant Sisters: A 
Social History of Finnish Immigrant Women 
in Canada. Toronto: Multicult. Hist. Soci- 
ety Ontario. 205 pp. 

Mason, K. O. 1987. The impact of women's 
social position on fertility. Sociol. Forum 
4:718-45 


Massey, D. S., Alarcon, R., Durand, J., Gon- 
zalez, H. 1987. Return to Aztlan: The Social 
Process of International Migration from 
Western Mexico. Berkeley: Univ. Calif. 
Press. 335 pp. 

Matthews, G. 1987. Just a Housewife: The 
Rise and Fall of Domesticity in America. 
New York: Oxford Univ. Press. 281 pp. 

Maxwell, N. L. 1988. Economic returns to 
migration: marital status and gender differ- 
ences. Soc. Sci. Q. 69:108-21 

Mejia, A., Pizurki, H., Royston, E. 1979. 
Physician and Nurse Migration: Analysis 
and Policy Implications. Geneva: Word 
Health Org. 476 pp. 


324 PEDRAZA 


Morrison, D. R., Lichter, D. T. 1988. Family 
migration and female employment: the 
problem of underemployment among mi- 
grant married women. J. Marriage Fam. 
50:161—72 

Orsi, R. A. 1985. The Madonna of 115th 
Street: Faith and Community in Italian Har- 
lem, 1880-1950. New Haven, Conn: Yale 
Univ. Press. 287 pp. 

Parr, J. 1987. The skilled emigrant and her 
kin: gender, culture, and labour recruit- 
ment. Can. Hist. Rev. 68:529-51 

Pedraza-Bailey, S. 1990. Immigration re- 
Search: a conceptual map. Soc. Sci. Hist. 
14:43-67 

Pedraza-Bailey, S. 1985. Political and Eco- 
nomic Migrants in America: Cubans and 
Mexicans. Austin: Univ. Tex. Press. 242 


pp. 

Perez, L. 1988. Cuban women in the U.S. 
labor force: a comment. Cuban Stud. 18: 
159-64 

Perez, L. 1986. Immigrant economic adjust- 
ment and family organization: the Cuban 
success story reexamined. Int. Migr. Rev. 
20:4~20 

Pessar, P. R. 1984. The linkage between 
the household and workplace in the experi- 
ence of Dominican immigrant women in 
we United States. Int. Mig. Rev. 18:1188- 

l1 

Pessar, P. R. 1986. The role of gender in 
Dominican settlement in the United States. 
In Women and Change in Latin America, 
ed. J. Nash, H. Safa. South Hadley, Mass: 
Bergin & Garvey 

Phizacklea, A., ed. 1983. One Way Ticket: 
Migration and Female Labour. Boston: 
Routledge & Kegan Paul. 162 pp. 

Piore, M. 1979. Birds of Passage. New York: 
Cambridge Univ. Press. 229 pp. 

Portes, A., Bach, R. L. 1985. Latin Journey: 
Cuban and Mexican immigrants in the 
United States. Berkeley: Univ. Calif. Press. 


pp. 

Portes, A. 1981. Modes of structural in- 
corporation and present theories of labor 
immigration. In Global Trends in Migra- 
tion: Theory and Research on International 
Population Movements, ed. M. M. Kritz, 
C. B. Keely, S. Tomasi, pp. 279-97. New 
York: Ctr. Migr. Stud. 433 pp. 

Portes, A. 1978. Migration and Underdevel- 
opment. Polit. Soc. 8:1—48 

Prieto, Y. 1987. Cuban women in the U.S. 
labor force: perspectives on the nature of the 
change. Cuban Stud. 17:73-94 

Reiff, J. L., Dahlin, M. R., Smith, D. S. 
1983. Rural push and urban pull: work and 
family experiences of older black women in 
southern cities, 1880-1900. J. Social Hist. 
16:39-48 

Reimers, C. 1985. Cultural differences in 


labor force participation among married 
women. Am. Econ. Rev. 75:251-55 

Rogler, L. H., Gurak, D., Santana-Cooney, 
R. 1987. The migration experience and 
mental health: formulations relevant to His- 
panics and other immigrants. In Health and 
Behavior: Research Agenda for Hispanics, 
ed. M. Gaviria, J. D. Arana, pp. 72-84. 
Chicago: Univ. Ill. Press. 294 pp. 

Rose, P. I. 1981. Some thoughts about ref- 
ugees and the descendants of Theseus. Int. 
Migr. Rev. 15:8-15 

Rumbaut, R. G. 1989. Portraits, patterns, and 
predictors of the refugee attainment process: 
results and reflections from the IHARP Pan- 
el Study. In Refugees as Immigrants: Cam- 
bodians, Laotians, and Vietnamese in 
America, ed. D. W. Haines, pp. 138-82. 
Totowa, NJ: Rowman & Littlefield. 198 pp. 

Safa, H. 1. 1984. Female employment and the 
social reproduction of the Puerto Rican 
working class. Int. Migr. Rev. 18:1168- 
87 

Salgado de Snyder, V. N. 1987. Factors 
associated with acculturative stress and de- 
pressive symptomatology among married 
Mexican immigrant women. Psychol. 
Women Q. 11:475-88 

Sanchez-Korrol, V. E. 1983. From Colonia to 
Community: The History of Puerto Ricans 
in New York City 1917-1948. Westport, 
Conn: Greenwood Press. 242 pp. 

Santana-Cooney, R., Ortiz, V. 1983. Nativ- 
ity, national origin, and Hispanic female 
participation in the labor force. Soc. Sci. Q. 
64:510-23 

Sassen-Koob, S. 1984. Notes on the incorpor- 
ation of third world women into wage-labor 
through immigration and off-shore produc- 
tion. Int. Migr. Rev. 18:1144-67 

Seller, M. S. 1975. Beyond the stereotype: A 
new look at the immigrant woman. J. Eth- 
nic Stud. 3:59-68 

Shin, E. H., Chang, K.-P. 1988. Peripheriza- 
tion of immigrant professionals: Korean 
physicians in the United States. Int. Migr. 
Rev. 22:609-26 

Simon, R. J., Brettell, C. B., eds. 1986. In- 
ternational Migration: the Female Experi- 
ence. Totowa, NJ: Rowman & Allanheld. 
310 pp. 

Stacey, J., Thorne, B. 
feminist revolution 
Probl. 32:301-15 

Stein, B. N. 1981. The refugee experience: 
defining the parameters of a field of study. 
Int. Migr. Rev. 15:320-30 

Sullivan, T. A. 1984. The occupational pres- 
tige of women immigrants: a comparison of 
Cubans and Mexicans. Int. Migr. Rev. 
18:1045-62 

Tienda, M., Jensen, L., Bach, R. L. 1984. 
Immigration, gender, and the process of 


1985. The missing 
in sociology. Soc. 


occupational change in the United States, 
1970-80. Int. Migr. Rev. 18:1021-44 
Tilly, L. A. 1989. Gender, women's history, 
and social history. Soc. Sci. Hist. 13:439- 
2 


6 

Tyree, A., Donato, K. 1986. A demographic 
overview of the international migration of 
women. In International Migration: The 
Female Experience, ed. R. J. Simon, C. B. 
Brettell. Totowa, NJ: Rowman & Allan- 
held. 310 pp. 

Vega, W. A., Kolody, B., Valle, J. R. 1987. 
Migration and mental health: an empirical 
test of depression risk factors among immi- 
grant Mexican women. Int. Migr. Rev. 
21:512-30 

Waldinger, R. 1986. Through the Eye of the 
Needle: Immigrants and Enterprise in New 
York's Garment Trades. New York: New 
York Univ. Press. 231 pp. 

Watts, S. J. 1983. Marriage migration, a ne- 
glected form of long-term mobility: a case 
study from Ilorin, Nigeria. Int. Migr. Rev. 
17:682-98 


WOMEN AND MIGRATION 325 


Weatherford, D. 1986. Foreign and Female: 
Immigrant Women in America, 1840-1930. 
New York: Schocken. 288 pp. 

Weinberg, S. S. 1991. The treatment of 
women in immigration history: a call for 
change. In Seeking Common Ground: Mul- 
ti-Disciplinary Perspectives on Immigrant 
Women, ed. D. Gabaccia. Wesport, Conn: 
Greenwood Press. In press 

Weinberg, S. S. 1988. The Worid of Our 

` Mothers: The Lives of Jewish Immigrant 
Women. Chapel Hill: Univ. N. Carolina 
Press. 325 pp. 

Wilkinson, R. C. 1983. Migration in Lesotho: 
some comparative aspects, with particular 
reference to the role of women. Geography 
68:208—24 

Yanagisako, S. J., Collier, J. F. 1987. Toward 
a unified analysis of gender and kinship. 
In Gender and Kinship: Essays Toward a 
Unified Analysis, ed. J. F. Collier, S. J. 
Yanagisako. Stanford: Stanford Univ. 
Press. 369 pp. 


Annu. Rev. Sociol. 1991. 17:327-50 
Copyright © 1991 by Annual Reviews Inc. All rights reserved 


POLICY DOMAINS: Organization, 
Culture, and Policy Outcomes 


Paul Burstein 
Department of Sociology, University of Washington, Seattle, Washington 98195 


KEY WORDS: policy domains, agenda setting, policy formulation, legislation, organizations, 
culture 


Abstract 


Sociologists interested in politics have increasingly turned in recent years to 
the study of policy domains—components of the political system organized 
around substantive issues. This review focuses on the process leading to 
legislative enactment of policy change and assesses issues and findings in 
three aspects of the political process:. agenda setting, the development of 
policy proposals, and the struggle for adoption of particular proposals. Quite a 
bit is known about adoption of proposals, but relatively little work has been 
done on agenda setting, and the task of understanding the development of 
policy proposals has barely begun. Policy change is affected most directly by 
formal organizations whose activities are channeled and given meaning by 
culture; government organizations play an active role in formulating policy 
and deciding how it will be implemented as well. 


INTRODUCTION 


Traditionally sociologists interested in politics analyzed the rise and fall of 
democratic government, the society-wide distribution of power, and individu- 
al political participation and party choice (Bendix & Lipset 1966, Burstein 
1988, Lipset & Rokkan 1967). In the last few years, however, sociologists 
have focused increasingly on public policy and how it is affected by private 
and public organizations, including corporations, interest groups, social 
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movement organizations, and government agencies. In a phrase, sociologists 
have begun to analyze policy domains. This review presents a framework for 
analyzing policy domains; it assesses recent work on the determinants of 
policy within domains, considering changes in the political agenda, the 
development of policy options, and legislative enactment; and it describes 
how policy domains are linked to the larger political system. 


POLICY DOMAINS: A FRAMEWORK FOR ANALYSIS 


A policy domain is a component of the political system that is organized 
around substantive issues. [Other terms used with the same meaning include 
*policy areas" (Amenta & Carruthers 1988:666), "sectors" (Scott & Meyer 
1983:137; Freeman 1985), "subsystems" (Freeman 1985), "dimensions" 
(Budge & Hofferbert 1990:114), “issue domains” (McDonagh 1989:121), and 
“programs” (Rose 1985:9)]. Those studying policy domains define them in 
terms of three sets of characteristics. The first and most obvious is substantive 
or functional. The issues that define a domain are seen as sharing inherent 
substantive characteristics which influence how they are framed and dealt 
with. Domains such as energy, health, transportation, or agriculture, for 
example, arguably have a certain logic and coherence, and most specific 
issues fit relatively unambiguously into these or other domains (see the 
discussion in Scott & Meyer 1983, cf. Jankowski 1988, Kingdon 1984, 
Salisbury et al 1987). 

Recently, sociologists and political scientists have begun to place less 
emphasis on the “inherent” qualities of policy domains. Instead, they argue 
that policy domains are largely socially constructed by those active in politics. 
Laumann & Knoke (1987:10) emphasize the organizational basis of such 
social construction. They define a national policy domain as a set of organiza- 
tions concerned about a set of substantive problems, which take each other 
into account as they formulate policy options and work for their adoption (see 
also Rose 1985, Sabatier 1988, Salisbury et al 1987, Scott & Meyer 1983). 

Others focus on the cultural basis of policy domains. When organizations 
define particular conditions as problems, develop policy options, and decide 
which other organizations to deal with, they are strongly influenced by 
cultural theories about how society works. Domains are cultural constructs 
around which organizations and individuals orient their actions (Cobb & Elder 
1983, Edelman 1988, Gamson & Modigliani 1987, 1989, Gusfield 1981, 
Meyer et al 1987:12, Stone 1989). 

As a practical matter, the substantive, organizational, and cultural ap- 
proaches produce similar results. Laumann & Knoke (1987), for example, 
delineate the domains they study organizationally, but their focus on health 
and energy policy seems reasonable from a substantive point of view. Never- 
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theless, those who adopt a social-construction approach to domains (e.g. 
Gusfield 1981, Laumann & Knoke 1987, cf Kingdon 1990) are often quite 
dubious about claims that issues have "inherent" substantive characteristics 
that influence the policy process. Few sociologists would argue that there is 
no inherent substantive basis to policy domains at all; but many would suggest 
that the substantive base can be manifested in a variety of ways, as individuals 
and organizations try to redefine issues and build new coalitions in ways that 
will benefit them. 

Many sociologists have analyzed policy domains in recent years. Studies 
include those by Amenta & Carruthers (1988), Pampel & Williamson (1985), 
and Quadagno (1989) on old age pensions; Amenta et al (1987) on unemploy- 
ment insurance; Berk et al (1977) on criminal justice; Boli (1987) on human 
rights; Boli & Meyer (1987) on policies pertaining to childhood; Burstein 
(1985, Burstein & Freudenburg 1978) on equal employment opportunity and 
foreign policy; Gamson & Modigliani (1987, 1989) on affirmative action and 
nuclear power; Garnier et al (1989) and Ramirez & Boli (1987) on education; 
Gusfield (1981) on drunken driving; Hooks (1990) on agriculture; Jenkins & 
Brents on welfare policy (1989); Knoke & Pappi (1989), Steinberg (1982), 
and Wilensky & Turner (1987) on labor; Laumann & Knoke (1987) on energy 
and health; Korpi (1989) on social rights; McAdam (1982) on civil rights; and 
Orloff & Skocpol (1984) on the welfare state. 

This increasing focus on policy domains seems to have occurred for two 
main reasons. First, the state controls a huge proportion of societal resources, 
and its policies affect everyone in important ways. Sociological theories of 
politics traditionally paid little attention to policy, but in the face of continu- 
ing state expansion and ongoing struggles over policy, sociologists came to 
base their research agenda a bit less on theory and a bit more on common 
sense, turning increasingly to the study of policy (see Burstein 1981, March & 

"Olsen 1989:ch. 1). 

Second, having decided to focus more on policy, sociologists began to alter 
their thinking about how to explain policy decisions. Those initiating new 
approaches to the study of the state (e.g. Skocpol & Amenta 1986), social 
movements (e.g. Gamson 1975, McCarthy & Zald 1977), and organizations 
(e.g. Meyer & Scott 1983) all serendipitously arrived at a conclusion pre- 
viously reached by many political scientists: that politics proceeds primarily 
in numerous relatively self-contained policy domains, each operating more or 
less autonomously with its own issues, actors, and processes (Freeman 1985). 
State action proceeds primarily through agencies organized to implement 
specific policies (Pampel & Williamson 1988, Rose 1985); social movement 
organizations typically seek changes in particular policies (Gamson 1975); 
and the complexity of modern life forces organizations and individuals to 
focus on a small set of policies about which they are especially concerned (cf 
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March & Olson 1989:ch. 9, Bosso 1987, Clausen 1973). To explain policy 
outcomes, it is therefore necessary to focus on policy domains. 

The conclusion that politics proceeds within relatively self-contained policy 
domains has two important implications for the analysis of policy change. 
First, those analyzing policy domains either hypothesize or assume that the 
domains are characterized by "causal autonomy:" Policy outcomes are 
affected primarily by forces within each domain (see, e.g., Freeman 1985, 
Hansen 1987, Sabatier 1988, Pampel & Williamson 1985). 

Second, assuming that policy processes within domains are indeed causally 
autonomous, then general society-wide conditions (such as the level of eco- 
nomic development or the relative strength of different political parties) will 
have little direct effect on policy (though they may constrain policy de- 
velopments or affect them indirectly by influencing organizational or cultural 
developments within domains). This is because policy change is determined 
directly by forces within each domain; no single factor (e.g. class relations) 
underlies policy change across domains, nor is there a unified elite controlling 
most or all domains (e.g. Laumann & Knoke 1987:377, Wilson 1980:ch. 10). 

Recent research does show consistently that within-domain forces are 
critical determinants of policy (e.g. Amenta et al 1987, Jacob 1988). General 
societal conditions are not very important (Amenta & Carruthers 1988) and 
would probably be found even less so in properly specified models (Jacob 
1988, Laumann & Knoke 1987, March & Olsen 1989:ch. 9, Wilensky & 
Turner 1987). 

Traditionally, those studying policy change devoted most of their effort to 
explaining the adoption of legislation. Beginning in the mid-1980s, however, 
some political scientists began arguing that the focus on adoption led those 
studying policy to miss critical earlier stages of the policy process, what 
Polsby called (1984:3) “the politics of inventing, winnowing, and finding and 
gaining adherents for policy alternatives . . . before moving alternatives from 
unlikely to possible to probable candidates for inclusion 'on an agenda for 
enactment” (cf Cobb & Elder 1983). Sociologists have begun to reach the 
same conclusion (e.g. Jenkins & Brents 1989, Pavalko 1989). 

Probably the most influential response to this realization is Kingdon’s, 
which is winning attention among sociologists (see Kingdon 1984, 1990, 
Laumann & Knoke 1987:45). Sharing Polsby’s view of prior work, Kingdon 
developed a model of the policy process based on Cohen, March, & Olsen’s 
(1972) “garbage can model of organizational choice.” He argued (1984:88— 
94) that the policy process is made up of three "streams," each largely 
independent of the others: the stream of problem recognition, in which issues 
get onto the political agenda; the stream of alternatives, in which policy 
proposals are formulated; and the stream of “politics,” in which choices are 
made among proposals. Policy change occurs when the streams come 
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together—"a problem is recognized, a solution is available, [and] the political 
climate makes the time right for change . . .” (1984:93, cf Burk 1985). This 
review considers work in each of the three streams in turn. It does not 
consider administrative or judicial decision-making or implementation in any 
detail (doing so would require an entire additional review; but see, e.g., 
Wilson 1980, Pressman & Wildavsky 1984, and Caldeira & Wright 1988). In 
addition, the focus is predominantly on. studies of American politics. 


PROBLEM RECOGNITION AND AGENDA SETTING 


How do issues come to be considered by a legislature? This is a, question 
about the political agenda, "the list of subjects or problems to which gov- 
ernmental officials, and people outside of government closely associated with 
those officials, are paying some serious attention at any given time" (Kingdon 
1984:3). Three questions have been critical to prior work: Where do public 
policy issues come from? How do some get on the "governmental agenda," 
gaining attention from those in or close to government? And how do a few of 
these reach the "decision agenda," where they are actively considered for 
adoption? (Cobb & Elder 1983:14, Kingdon 1984:4). 


Where Issues Come From 


Most of those analyzing the political agenda agree that issues do not simply 
arise out of objective conditions; rather, they are "continuously constructed 
social phenomen[a]," to use Laumann & Knoke's (1987:15) phrase, created 
by people trying to make sense of their world and deciding how to act. 

Little is known about how this occurs. As Laumann & Knoke write 
(1987:378—79), "only a very limited number of matters are selected for 
dispute and public controversy, the result of poorly understood processes 
whereby key actors come to contest the symbolic framing in which ‘routine’ 
decisions had heretofore been made," or the actors may see a public issue 
where none had been perceived to exist before. Even phenomena that seem 
self-evidently matters of public concern—for example, drunken driving, 
environmental pollution from pesticides, and child abuse—were not always 
viewed as public issues (see Gusfield 1981, Bosso 1987, Nelson 1984, cf 
Hilgartner & Bosk 1988). 

Issue creation is a cultural and social process. For something to become a 
public issue, it first must be defined as a problem amenable to human 
solution. This is no small matter; as Kingdon has written (1984:121), "Getting 
people to see new problems, or to see old problems in one way rather than 
another, is a major conceptual and political accomplishment" (see also Stone 
1989, cf Snow et al 1986). A key element in defining something as a public 
issue seems to be the development of a "causal story" purporting to explain 
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how a group comes to experience harm and to show who is to blame and must 
take responsibility (Stone 1989, cf Gusfield 1981). 

Causal stories have a social base. Stone (1989) claims that groups self- 
consciously organized to generate ideas, particularly scientists (including 
social scientists) and lawyers, are especially likely to develop causal stories 
and win acceptance for them. Their expertise gives their ideas special status, 
and the highly developed professional communications networks available to 
them provide opportunities to disseminate their ideas (cf Jacob 1988, Knoke 
1990, Sabatier 1988, Stryker 1990). 

The number of issues generated depends partly on how many organizations 
have members who see the creation of issues as part of their jobs. Thus, more 
issues will be generated in decentralized political systems where multiple 
legislatures and agencies may encourage their development (Kelman 1981, 
Knoke & Laumann 1982); where social movement organizations and interest 
groups abound (Gais et al 1984); where politicians believe they will benefit 
from creating issues and have the resources to do so (Kelman 1981); where 
political parties develop ideologically based programs (Orloff & Skocpol 
1984); and where governmental and nongovernmental organizations have 
policy intellectuals on their staffs (Orloff & Skocpol 1984). 

Very little is known, however, about how great an impact groups, in- 
dividuals, or institutional structures have on the generation of ideas, or about 
their relative importance. And little is known about the causal stories—why 
they take the form they do or develop for some potential issues but not others 
at a given time. Many researchers believe that dramatic events draw attention 
to issues (Kingdon 1984, Carmines & Stimson 1989), but none has shown 
how to predict which sorts of events lead to the generation of issues and which 
do not. 


Getting on the Governmental Agenda 


Whether an issue gains attention from those in or close to government 
depends on the characteristics of those generating ideas, those in or close to 
government, the relations between them, and the relationship between the 
issue and the broader environment. 

Issues are more likely to gain a hearing if their creators have the resources 
to make their cases credibly, persistently, and in ways seen as potentially 
useful by those in power (Gamson 1975, Jacob 1988, Laumann & Knoke 
1987). Some scholars claim that it has become easier to win attention because 
improvements in communications technology and management techniques 
have increased the number of interest groups and their organizational capaci- 
ties (Gais et al 1984, Schlozman & Tierney 1986, Walker 1983). 

Issues are also more likely to gain a hearing if those in or close to 
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government are accessible and open to new ideas. Accessibility and openness, 
in turn, are affected by organizational factors. Members of Congress are more 
accessible than members of most European parliaments because members of 
Congress are much more responsible for their own careers, and they believe 
that promoting a new, potentially popular issue can improve their prospects. 
Some even instruct staff members to look for ideas in the professional 
literature in areas of interest (Kelman 1987). 

There can be considerable variation among government agencies as well. 
Some engage primarily in routine administration and have little interest in 
research or entrepreneurial activities. Others see themselves as innovators and 
are organized to seek out and implement new ideas (Amenta et-al- 1987, 
Heidenheimer et al 1983, Nordlinger 1981, Wilensky & Turner 1987). 

Because openness varies among governments and agencies, new issues are 
likely to move onto the governmental agenda more easily in decentralized 
political systems with more points of access. In the United States, policy 
innovators often try to get an idea on the agenda in a number of states and at 
the federal level simultaneously, often succeeding in some places while 
failing in others (Burstein 1985, Steinberg 1982). 

Connections between technical experts and legislators or bureaucrats also 
affect the movement of issues onto the governmental agenda. As Amenta et al 
have shown (1987), for example, the establishment of formal, ongoing ties 
between expert commissions and the legislature in some American states led 
to policy innovation. Legislative committees and executive departments in the 
United States have many ways to get advice from scientific experts, including 
holding committee hearings, hiring academics to fill staff positions, and 
commissioning reports from the National Academy of Sciences. Because the 
number and type of such connections will vary over time, between countries, 
and across policy domains, some new issues will gain attention more easily 
than others (Heidenheimer et al 1983, Polsby 1984). 

Legal rules regulating relationships between governmental and non- 
governmental organizations also influence movement onto the governmental 
agenda (Bosso 1987). Tax laws denying exemptions to lobbyists discourage 
attempts to influence Congress, for example, but liberalization of the require- 
ments for legal standing have made it easier for reform groups to get their 
point of view before the courts and, sometimes, administrative agencies. 

The relationship between the issue and the broader environment may affect 
access to the governmental agenda. Some scholars claim that the environment 
may be especially receptive to ideas compatible with those already widely 
accepted. For example, they argue that it was easier for child abuse to get on 
the agenda in the twentieth century than it might have been earlier because 
childhood is increasingly seen as a protected, special status (Nelson 1984, cf 
Boli & Meyer 1987); and health maintenance organizations won support from 


334 BURSTEIN 


policymakers when portrayed as a way to cut costs at a time when cost-cutting 
became a dominant motif of federal policy (Kingdon 1984). 

Recent work suggests that increasing political entrepreneurship, con- 
gressional staff size, and organization by professionals, interest groups, and 
social movements have made it easier for new issues to gain an initial hearing. 
There are some dissenting voices, however. Some believe that growing 
reliance on experts reduces participation by non-experts, whose ideas lose 
legitimacy and are closed out of the political process (Cobb & Elder 
1983:epilogue; cf Weiss 1989). Hilgartner & Bosk have argued (1988) that 
institutions and individuals can deal with only a limited number of issues at 
any particular time. Room for new issues will always be limited, and unless 
the carrying capacity of governmental organizations is increased, the political 
system's increasing openness may merely increase the intensity of the compe- 
tition for attention, rather than actual access to the agenda. 

Substantial progress is being made in developing research methods suitable 
for further testing these hypotheses, identifying useful indicators of influence 
attempts and agenda placement, and describing relevant actors (Gais et al 
1984, Salisbury et al 1987, Schlozman & Tierney 1986, Walker 1983). 
Unfortunately, few hypotheses have been tested systematically in a broad 
sample of policy domains; most work examines either a single domain or a 
single type of organization (such as interest groups; for an exception, see 
Laumann & Knoke 1987:ch. 14). The task of analyzing the struggle to get 
ideas onto the governmental agenda has barely begun. 


The Decision Agenda 


Movement up the agenda will be affected by the same factors that get an issue 
on the governmental agenda initially, and by four additional factors as well. 

The first is the mass media. The media rarely get issues onto the agenda; in 
fact, reporting of an issue often begins in response to congressional hearings 
(Walker 1977, Costain 1988). By bringing an issue to the attention of a broad 
public, however, the media may help move it from committee hearings to 
floor debate. 

The second is the role of the public relative to those professionally con- 
cerned about an issue. Some scholars argue that although organizations and 
experts may get an issue onto the agenda, movement upward depends upon 
pressure from a relatively broad public. Jacob (1988) has challenged this 
view, however, claiming that major legislative changes can occur without 
popular concern, if experts are relatively united and get the issue viewed as a 
narrow, technical one. 

The third factor affecting movement up the agenda is how the issue is 
perceived by those potentially affected by it. Movement upward is more likely 
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(a) if the issue is defined narrowly, so that few groups take an interest and it 
arouses little resistance; (b) if, when there is significant resistance, the mass 
public becomes involved; (c) if it addresses an easily understood problem and 
offers a simple, easily understood solution; and (d) if it appears possible to 
deal with the issue at little cost (Jacob 1988, Nelson 1984, Walker 1977). 

Finally, what Kingdon has called “focusing events" (1984, p. 99), such as 
crises or disasters, are widely seen as important for movement up the agenda 
(cf Hilgartner & Bosk 1988, Nelson 1984, Garrow 1978). Policy-makers 
attribute much greater importance to such events in some policy areas than 
others, however, and they seem irrelevant to much routine policy-making 
(Kingdon 1984, Jacob 1988). More basically, events may have little meaning 
by themselves; they are given meaning by groups utilizing particular in- 
terpretive frameworks and may affect politics only when groups are ideologi- 
cally and organizationally prepared to take advantage of them. 

Taken as a whole, the literature on agenda setting is so small that gaps are 

easier to identify than controversies (Cobb & Elder 1983, Kingdon 1984, 
Nelson 1984, Polsby 1984). The most obvious gap is the lack of a theory 
predicting what kinds of issues will arise and gain attention (Tilly 1975). It is 
always possible to provide a convincing retrospective explanation of why an 
issue arose, but no one knows how to provide a prospective analysis accurate- 
ly predicting which issues will become important and which will not (cf Tilly 
1975). 
- Prior work does contain a number of controversies, often implicit. One 
concerns the impact of objective circumstances on problem recognition. Most 
social scientists agree that issues are not simply created by objective circum- 
stances; how conditions are interpreted is important, and interpretations are 
often subject to dispute. The scholars most committed to a social-construction 
view suggest that how phenomena are dealt with is so much a matter of 
interpretation that no phenomena is a priori more likely than any other to be 
the basis for a political issue (Gusfield 1981). Others, however, while 
acknowledging that interpretation is important, also claim that some objective 
characteristics of issues—such as how many people will be affected, or 
whether economic interests or health are threatened —make some conditions 
more likely than others to become political issues (Walker 1977). 

The source of new ideas is also controversial. Most work assumes that 
policymakers have little time to seek out new ideas, much less generate any, 
and that they therefore pick up on ideas that are readily available (e.g. Kelman 
1987, Polsby 1984, Schneider & Ingram 1988). But where do they get these 
ideas? Some suggest that communities of thought within policy domains are 
so important that ideas spread primarily within domains, even across national 
boundaries (Freeman 1985). Others argue that the boundaries between policy 
domains are more permeable than national boundaries; policymakers and 
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experts in one domain borrow ideas from other domains close to their own 
organizationally and in terms of how ideas are framed (Walker 1977). 

The hypothesis that ideas will be more readily accepted if they are compat- 
ible with ideas already popular seems commonsensical and is widely accepted 
(see, e.g., Kingdon 1984, Nelson 1984). But some disagree. There is no good 
way to gauge the compatibility of one idea with others, so claims about 
compatibility are usually made after-the-fact: acceptance of an idea demon- 
strates its compatibility with those already popular. This is a circular argu- 
ment (Jacob 1988). The hypothesis may be correct, but the evidence for it is 
weaker than most people suppose. 


THE SPECIFICATION OF ALTERNATIVES 


Theory and Measurement 


Long ago Schattschneider argued (1960:68) that “the definition of alternatives 
is the supreme instrument of power." It is difficult to imagine a satisfactory 
analysis of policy change which does not seriously examine the alternatives 
proposed. 

Narrative histories of legislative change nearly always describe how bills 
are amended on the floor, and some trace the development of policy proposals 
farther back (see, e.g., Bosso 1987, Glendon 1989, Harrison 1988). Such 
work is almost entirely descriptive, however. Herbert Simon has noted 
(1985:302) that little systematic research has been done on how alternatives 
are generated; theoretical and statistical work on policy generally ignores the 
content of alternatives altogether. 

A major reason for ignoring how policy proposals are generated and what 
they say is methodological, and the consequences are severe. As Hanson 
argues (1983:771) in a statement referring to welfare policy but applicable 
much more generally, “we have not made substantial theoretical headway in 
trying to understand and explain welfare policymaking . . . largely because of 
our continued reliance on an impoverished conceptualization and measure- 
ment of welfare policy." We cannot explain policy change because we cannot 
systematically describe and measure the most critical variable: policy change 
itself. 

The best way to analyze policy change, and proposals for policy change, is 
content analysis. Content analysis has a long history in the sociological study 
of politics. Ogburn's 1912 PhD dissertation on child labor legislation relied 
on-an early form of it (Ogburn 1912); it was widely used during and after 
World War II (Janowitz 1970:207—-22);-and after a hiatus in the 1960s, it came 
to play a crucial role in the analysis of social-movements, as Gamson (1975), 
Jenkins & Perrow (1977), McAdam (1982), and others utilized it to create 
measures of social and political change. 
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Little of this work utilizes content analysis to describe policy change, 
probably because policy proposals are so complex, and difficult to analyze. A 
few sociologists have used content analysis to analyze policy, however, and 
their work shows that it can help us learn a great deal about policy change. 
These sociologists include Berk, Brackman, and Lesser (1977), who ex- 
amined criminal justice legislation; John Meyer and his collaborators, who 
analyze changes in constitutional provisions defining childhood, human 
rights, and other issues (e.g. Boli 1987, Boli & Meyer 1987); Steinberg, 
whose analysis of labor legislation shows that systematic content analysis can 
lead to descriptions of policy change fundamentally different from those 
found in conventioríal narrative histories (1982); and Burstein, one of the few 
to consider not only alternatives adopted by the legislature, but those consid- 
ered and rejected as well (Burstein 1985, cf Burstein & Freudenberg 1978). 

Other sociologists are becoming aware of the need to content-analyze laws 
and policy proposals (e.g. Jenkins & Brents 1989, Korpi 1989, Pavalko 
1989), but few try formally to describe the content of policy proposals and 
analyze how they are generated. Thus, the content analysis of policy has yet 
to cohere into a recognized literature; those analyzing one policy domain often 
ignore those working in others, and theorists seem unaware of much empirical 
work (see also Polsby 1984, Kingdon 1984, Riker 1986). Methods for 
describing policy alternatives tend to be developed on an ad hoc basis in each 
study, and theoretical treatments, arguably making a virtue out of necessity, 
draw on biological metaphors (Kingdon 1984, cf Carmines & Stimson 1989) 
which permit after-the-fact analyses of alternatives but preclude prediction 
about future developments. 

What is needed are conceptual and methodological advances providing the 
basis for systematic analysis and comparison of public policies (see Duffy 
1989, Hofstetter 1981, Janowitz 1970:207—22). What is available are some 
very general guidelines from past work. 

The most useful analyses of policy proposals are based on systematic 
content analysis, and they meet several criteria. First, the coding is based on 
objective characteristics of the proposals. Coding content along ideological 
dimensions—from "conservative" to "liberal" for example—while common 
(see, e.g. Erikson et al 1989) assumes the meaning of proposals to be fixed 
and obvious, whereas it is a basic premise of the most thoughtful work on 
content analysis that meanings are debatable and may change (see especially 
Steinberg 1982, Duffy 1989, Janowitz 1970:207—22). 

Second, the policy alternatives are analyzed along dimensions permitting 
comparisons over time, across domains, and among polities. An explicit 
theoretical basis is best for defining the dimensions, but if there is none, 
analysts at least rely on their experience in comparing policies to develop 
broadly useful dimensions. Those who propose policy alternatives often draw 
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on approaches used previously in other domains or political systems, and the 
examination of such "borrowing" may lead to the discovery of common 
dimensions underlying a wide range of policy proposals (Schneider & Ingram 
1988, Thomas et al 1987). 

Third, measurement along particular Jaen is specific, but not so 
specific that overall patterns get lost in a welter of technical legal detail. 
Determining the most useful level of specificity is a goal of all those who 
analyze political and legal documents, but thus far doing so is entirely a 
matter of art, depending on the judgment and experience of researchers 
working with little theoretical or methodological guidance (Ratner & Burstein 
1980). 

Finally, analyses are organized so that “roads not taken”—possibilities not 
proposed—are to be identified, showing how a theoretically meaningful set of 
possible proposals is related to the narrower set actually considered. Those 
analyzing policy change over long periods of time or across political units 
(national states, or subnational units such as American states) are often 
especially aware of policies adopted in one place or time but not adopted or 
even proposed in others (e.g. Bosso 1987, Burstein, 1985, Gusfield 1981). 
Such awareness can enrich analyses by sensitizing researchers to the need to 
explain, in policy terms, what does not happen as well as what does; but 
theoretical work enabling researchers to consider systematically a wide range 
of possible policy alternatives is just beginning (Schneider & Ingram 1988). 

These criteria and the work just cited suggest that content analyses of policy 
should include at least the following dimensions: (a) whose activities are the 
object of the proposals (e.g. business organizations, government agencies, or 
specified categories of individuals); (b) on whose behalf the proposal is made, 
where explicit (e.g. specified minorities in antidiscrimination proposals, new 
parents in parental leave proposals, the elderly in social security proposals); 
(c) what activities are the object of the proposals (e.g. particular aspects of 
labor relations, education, manufacturing); (d) what the proposed legal stan- 
dards are, that is, what is required or prohibited (e.g. a specified minimum 
wage, limit on pollution, standard for deciding whether there is a restraint of 
trade); (e) who is required to bear the cost of the proposal, insofar as that is 
stated explicitly (e.g. a requirement that employees be paid their wages by 
their employers during periods of family leave, rather than requiring em- 
ployees to bear the burden or having the government subsidize the leave); (f) 
who bears the burden of enforcement, and what institutional mechanisms are 
proposed (including giving the government sole responsibility for enforce- 
ment, giving aggrieved individuals responsibility, and shared responsibility; 
judicial versus administrative mechanisms); and (g) the penalties to be im- 
posed for disobeying the proposed law (civil versus criminal; retrospective 
versus prospective approaches). 
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Proposal Heterogeneity and Survival 

Although social scientists have had little systematic to say about policy 
content, they have generated some propositions about proposal heterogeneity, 
survival, and change. 

Prior work suggests that the heterogeneity of proposals within a domain 
will be affected in somewhat contradictory ways by competing organizational 
forces. If only a few like-minded actors are interested in a policy area, 
proposals will be few and differences among them minor (cf Jacob 1988). To 
the extent those involved have different histories, interests, or points of view, 
however, more proposals are likely to be “floating around” in the “primeval 
soup” of policy (to use Kingdon’s terms 1984:ch. 6). 

When there is heterogeneity among proposals, two major forces work to 
reduce it. The first is the widespread tendency toward institutional isomorph- 
ism (Dimaggio & Powell 1983, Boli & Meyer 1987, Meyer et al 1987): 
Groups copy others’ approaches to problems as a way to cope with un- 
certainty or to emulate the powerful and prestigious. The desire to avoid 
uncertainty is particularly powerful in legal institutions; those who write laws 
try to use language that has been used before and been given precise meaning 
through previous interpretations. ; 

The second force reducing heterogeneity is knowledge that if policy change 
is to occur, majority support for a particular proposal is needed. For many 
legislators, this is a strong inducement to compromise on policy proposals or 
to engage in logrolling, in order to build a majority. And once a coalition 
favoring a particular proposal has been constructed, those involved are likely 
to avoid developing new proposals, for fear that doing so will cause the 
coalition to collapse and delay policy change (Sabatier 1988, Burstein 1985, 
Riker & Brams 1973). 

The push to reduce the number of proposals has three further implications. 
First, most new proposals will resemble old ones, either recombining familiar 
elements (Kingdon 1984) or extending ideas along seemingly commonsense 
dimensions (from younger to older, children to parents, big firms to small 
ones; see Schneider & Ingram 1988). Second, because familiarity is a virtue 
and coalition building a necessity, proposals long on the agenda are more 
likely than newer proposals or proposals considered intermittently to gain 
adherents and to get onto the short list of those seriously considered for 
adoption (cf Ratner 1980). And third, because time matters, persistence on 
the part of organizations supporting particular proposals will make a differ- 
ence; and persistence, in turn, will depend in part on resources. 

Prior work suggests a number of hypotheses that are ripe for testing in a 
sample of policy domains. 

l. The more organizationally fragmented a policy arena, as gauged by 
numbers of congressional committees, executive agencies, and lobbying 
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organizations involved, the greater the heterogeneity of proposals (cf Kingdon 
1984, Laumann & Knoke 1987). 

2. Movement up the policy agenda (from bill to hearing to floor action) 
will attract attention, increasing the number of organizations involved, and 
bence the heterogeneity of proposals (but see No. 4 below and Knoke 1990). 

3. To the extent some policy proposals are modeled on others, they are 
likely to have been made in similar, and hence relatively well understood, 
political and social environments. Scholars disagree, however, about the 
extent to which similarity will be defined by national boundaries (with 
borrowing in the United States taking place across state lines) or policy 
domains (with borrowing from the same domain in other countries; see 
Freeman 1985, Rose 1985, Walker 1977). 

4. Controlling for fragmentation and level on the agenda, the longer 
proposals on a particular subject have been under consideration, the more 
homogeneous they will become. 

5. The longer particular proposals have been continuously under con- 
sideration, the more likely they are to be adopted. 

6. For any particular issue, the greater the proportion of organizations 
formally favoring a particular proposal, the more likely that proposal is to 
move up the agenda and be adopted. 

7. Changes in the content of proposals over time will include recombina- 
tions of elements in previous proposals and small logical extensions of 
previous proposals, rather than major innovations or reformulations. 

These hypotheses imply some things not likely to occur among alternatives 
presented to legislatures: Proposals dramatically different from those already 
considered are not likely to be introduced; the content of newer proposals on 
an issue will be little affected by changing social, political, or economic 
circumstances; and legislation is not likely to result from the dramatic in- 
troduction of a new idea (Thomas et al 1987, Burstein 1985). 


POLITICS: SELECTING AMONG PROPOSALS AND 
ENACTING LEGISLATION 


The "political stream" affecting enactment, Kingdon argues (1984:152), 
flows independently of other streams affecting agenda setting and proposal 
formulation. Even when the same groups play a role in all three streams, they 
are likely to focus their efforts on different streams, and their impact may vary 
greatly from one stream to another (Burstein 1985). In addition, a critical role 
in the policy process is played by those who bring the three streams together 
in what Kingdon (1984:174) has called a policy window. 

The political stream has been the object of most research on policy change 
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and is the focus of the most familiar controversies about democratic politics— 
for example, "pluralist" versus class and elite models. In recent work on 
policy domains, four issues are of special concern: the openness of the 
political process, the impact of policy on politics, the role of political parties, 
and the relationship between public opinion and political outcomes. 


Openness in the Political Process 


Two images of the political process compete for dominance in studies of 
policy domains: Schattschneider's "iron triangles" and Heclo's "issue net- 
works" (see Heclo 1978, Hansen 1987). Iron triangles are a metaphor for 
policymaking controlled by three groups in a domain—an executive depart- 
ment, a congressional committee, and a dominant interest group. The three 
are informally but tightly bound to each other and minimize the influence of 
outsiders. Issue networks are more open and less organized; participants move 
in and out on the basis of expert knowledge ànd changing levels of interest, 
and there is more disagreement among participants than there is in iron 
triangles. In Heclo's conception (1978:105), “clouds of issue networks . . . 
overlay the once stable political reference points [iron triangles] with new 
forces that complicate calculations, decrease predictability, and impose con- 
siderable strains on those charged with government leadership." 

The contrast between the two images reflects a longstanding, fundamental 
issue in the study of politics: how open the political process is to outside 
influence (see, e.g., Polsby 1980, Alford & Friedland 1985). Heclo and 
others (e.g. Gais et al 1984, Hansen 1987, Bosso 1987, Wilson 1980, 
Schlozman & Tierney 1986, Salisbury et al 1987, Laumann & Knoke 1987) 
have moved toward two major conclusions during the last few years. First, the 
impression that iron triangles were pervasive was based in part on bad 
sampling. Schattschneider and others studied policy domains that were tightly 
organized and elosed to outside influence; they then generalized their con- 
clusions to American politics as a whole, without wondering whether the 
domains they studied might not be more closed off than most. Second, 
however common iron triangles may have been at one time, they are less so 
now because the political process has been opened up in many ways. 

At this point, all conclusions about the openness of the political process 
have to be viewed very cautiously. The imagery of "iron triangles" and 
“clouds of issue networks" is striking, but it is based largely on qualitative 
case studies. These are often rbetorically persuasive, but lack the rigorous 
conceptualization and measurement needed to precisely describe organiza- 
tional interactions or gauge influence in policy domains, to trace change over 
time, or to compare domains to each other. Significant progress has been 
made recently in refining the concepts and quantitative techniques needed to 
test the competing views, but the most sophisticated researchers tend to be 
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very modest about what they have accomplished so far (e.g. Gais 1984 and, 
especially, Salisbury et al 1987, Knoke & Pappi 1989). 


Policy and Politics 


A second major concern of those studying Egislative politics is what they call 
the impact of policy on politics, meaning Row the nature of particular issues 
affects politics within the relevant domzin (Freeman 1985, Lowi 1979, 
McFarland 1987). Wilson has argued (1982), for example, that the distribu- 
tion of costs and benefits implied by policies will affect domain organization. 
When both the costs and benefits of a policy are widely distributed, small 
interest groups will have little incentive to crganize to exert influence, and the 
result is likely to be what he calls “majoritarian politics." When costs and 
benefits are both concentrated, a policy often benefits one small group at the 
expense of another; both will have incentives to organize, the mass public will 
be indifferent, and the result will be “interest group politics.” “Client politics" 
results when benefits are concentrated but costs dispersed; potential benefici- 
aries have reason to organize, but those paying the costs will not. Agricultural 
subsidies paying a lot to a small number of farmers at the cost of a small 
increase in taxes on millions of consumers would be an example. Finally, 
when policies confer benefits widely at a cost borne mostly by relatively small 
numbers of people, the result will be "entreoreneurial politics." That this kind 
of policy should occur seems unlikely, given that opponents have strong 
incentives to organize while potential beneficiaries do not, because the cost of 
organizing will exceed any likely benefit. But Wilson claims that this is 
happening increasingly often, as political entrepreneurs successfully claim to 
represent vicariously a large group (legisletion limiting air pollution to curb 
acid rain would be an example). 

Many of those studying policy domains believe that the nature of issues 
affects the organization of the political prozess, and Wilson's is not the only 
typology trying to identify patterns of poHtical action common to domains 
sharing significant characteristics. Finding such patterns would help over- 
come the limitations of one-domain case studies. If a domain could be seen as 
representing a large number of. domains of the same type, findings from a few 
domains could be generalized to others. 

Typologies of policy domains all suffer fom two major problems, however 
(Greenberg et al 1977, Steinberger 1980). It is often difficult to identify a 
particular policy unambiguously as a partic.ilar type, and it often appears that 
the characteristics of a policy, rather thar being inherent in its nature, are 
socially constructed and subject to manipulation (see, e.g., McDonagh 1989, 
Paine 1989, Weiss 1989). Wilson and cthers make a plausible case, by 
providing examples, that political conflic: is affected by characteristics of 
policies; but little is known about how the policies come to be seen as having 
particular characteristics. 
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Political Parties 


Political parties are among the most important organizational innovations of 
modern societies, making mass politics possible as they link the public to the 
government by aggregating popular preferences, building coalitions, formu- 
lating policies, and stimulating support for favored programs. Their place in 
analyses of policy domains, however, is far from clear. Parties of the left may 
have a major role in the adoption of social policies, but their impact on policy 
adoption is not always strong, and sometimes there is no relationship at all 
(Burstein 1981, Skocpol & Amenta 1986, Korpi 1989, Pampel & Williamson 
1988). And parties seem to play a minimal role in the enactment of legislation 
in domains less closely identified with their core interests. 

There seem to be two major reasons why conclusions about the role of 
parties in policy adoption are inconsistent. First, many of those gauging the 
impact of parties, particularly sociologists, have not absorbed critical theoreti- 
cal work about the relationship between party stances and public opinion on 
particular issues. Parties hoping to win office often tailor their platforms to 
what the public wants in order to win votes; therefore, knowing how many 
people voted for a party means little unless the analysts also know what they 
want and how the party has tried to appeal to them (Tufte 1978, Page 1978). 
The lack of consistent findings stems from the failure to investigate carefully 
what stances the parties feel obliged to adopt. 

Second, party impact may differ from domain to domain or country to 
country because of how parties and interest groups are organized. Hansen 
(1987) argues that American political parties historically served as in- 
termediaries between the public and legislators, providing legislators with 
crucial information about what the public wanted and communicating back to 
the public the legislators’ claims about how well they had served constituents’ 
interests. In recent decades, however, congressional specialization, reduc- 
tions in the cost of organizing interest groups, the ongoing nature of rela- 
tionships between congressional committees and particular interest groups, 
and other factors have sometimes made interest groups more effective in- 
termediaries than parties are, from the legislators’ point of view. In addition, 
Jankowski has claimed (1988), broad-based corporatist interest groups may 
aggregate preferences more effectively than either parties or narrower interest 
groups, leading them to favor policies producing more optimal economic 
outcomes (cf Wilensky & Turner 1987). 


Public Opinion 

No analysis of policy change should ignore the relationship between public 
opinion and public policy, because the idea that governments should respond 
to their citizens is so central to both political scientists’ normative theory of 
democracy and the public’s conception of it. Yet most sociologists’ studies of 
policy domains do ignore public opinion, sometimes because appropriate data 
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are unavailable, but more often because they do not believe that public 
opinion can affect policy—despite the existence of a persuasive theoretical 
literature and a growing body of empirical work showing that there should be, 
and indeed is, a substantial relationship between the two (e.g. Mayhew 1974, 
Fiorina 1974, Page & Shapiro 1983, Erikson et al 1989, Burstein 1985). 

Debate about public opinion and public policy has long focussed on 
whether a significant relationship exists between the two; the works just cited 
are part of that debate. Recently, however, the debate has shifted, propelled 
by increasing understanding of two phenomena: the gap between the vague- 
ness of public opinion and the specificity of public policy, and the instability 
and malleability of mass and elite political preferences (see, e.g., Burstein 
1985, Kingdon 1984:ch. 3, 1988, Carmines & Stimson 1989, March & Olsen 
1989:ch. 9). As Carmines & Stimson have written about the politics of race, 
“Although it would be foolish to assert that congressmen [sic] are un- 
responsive to constituents' desires on this issue—of all policy domains it is 
usually the single best case of policy representation—the primary role for all 
institutional actors . . . is to lead public opinion. Policy making institutions, 
unlike the public, are confronted with specific questions that demand resolu- 
tion" (1989:60). 

Often the notion that elites lead public opinion is taken to mean simply that 
elites have fooled the public into thinking that it supports policies which elites 
were going to adopt anyway. But some recent work (March & Olsen 1989:ch. 
9, Kingdon 1988, Carmines & Stimson 1989) suggests that reality is more 
complex. Often elites and masses are part of the same complex process in 
which issues are raised, policy options formulated, and opinion created in the 
course of policy debates. While some may try to influence this process, no 
one really controls it. 

As a result, an increasingly common view is that public opinion constrains 
rather than directs public policy. The political process involves not so much 
the legislature trying to follow public opinion in detail, but rather its trying to 
satisfy the public generally and to avoid opposition to what the public clearly 
wants. This hypothesis is quite persuasive but has not been rigorously tested. 
Agreed-upon ways of assessing how opinion constrains policy have not been 
developed, and most studies of policy domains do not even deal directly with 
the issue. 


POLITICAL DOMAINS AND THE LARGER SOCIETY 


Sociologists’ focus on policy domains has substantially increased their un- 
derstanding of how organizations and culture affect public policy. It has not 
been easy, however, to show how policy changes within domains are linked 
to broad patterns of social change. Some researchers (e.g. Korpi 1989) 
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analyze policy change within a particular domain without considering the 
impact of domain-specific organizations—an approach which seems mis- 
guided in light of what has been learned about the importance of forces 
operating within each domain. Others (e.g. Amenta & Carruthers 1988) 
consider the impact of both broad social forces (such as industrialization) and 
domain-specific factors (such as social movement activity) on policy out- 
comes, but these are viewed as alternative explanations when it might seem 
more reasonable to view the domain-specific factors as partly the result of the 
broader forces. Only a few scholars (e.g. Pampel & Williamson 1988) have 
begun to assess how broad social forces interact with domain-specific factors 
to determine political outcomes, and even they fail to take into account the 
organizational complexity of contemporary societies (see also Knoke & Bur- 
leigh 1989, Scott & Meyer 1983, March & Olsen 1989, Scott 1983a, cf Lipset 
& Rokkan 1967). 

There is thus a need for research designed to show how change within 
policy domains is affected by forces central to earlier sociological studies of 
politics, including major sociodemographic groups, political parties and other 
encompassing organizations, and individual citizens. Two recent types of 
work address these concerns. 

The first is Carmines & Stimson's Issue Evolution (1989). They show how 
the politics of race has been transformed since the New Deal, considering the 
linkages among members of Congress, political activists, the political parties, 
coalitions of sociodemographic groups, and ideological changes among both 
elites and masses. Adopting a perspective with parallels to Kingdon's, they 
describe how organizations connect the public and the government, and they 
suggest ways in which social scientists should begin to reformulate their ideas 
about the nature of representation in democratic societies. 

The second relevant body of work is that on corporatism and its relationship 
to democratic politics (e.g. Jankowski 1988, Wilensky & Turner 1987, cf 
Hansen 1987). The grouping of many specific interests into corporatist 
bargaining structures may help overcome the policy fragmentation and paraly- 
sis sometimes said to characterize the United States, and corporatist organiza- 
tions may at times link major social groups to government more effectively 
than political parties do. It would be very useful at this point to analyze the 
relationship between domain-specific interest organizations and those 
transcending particular domains, and how both influence political representa- 
tion and policy change. 


CONCLUSIONS 


Sociologists’ interest in public policy has grown substantially in recent years; 
increasingly they have decided that the most effective way to study policy is 
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to focus on change within policy domains. Such domains are to a considerable 
extent social constructions, created through interactions among organizations 
and given meaning by culture. Policy change takes place through the conjunc- 
tion of three streams of activity moving fairly independently: problem 
recognition and agenda setting; the specification of policy options; and the 
politics of selecting among proposals and enacting legislation. 

Substantial progress has been made in recent years in the analysis of all 
three streams. By far the greatest amount of work has been done on the 
political stream, focusing on the period just before legislative enactment; but 
work on the other streams is increasing rapidly. 

It is possible to reach some general conclusions about the determinants of 
policy change. Public policy is influenced primarily by formal organizations 
and the relations among them, both informal and as structured by formal rules 
governing interorganizational relationships. The relationship is problematic 
between organizations and the sociodemographic groups usually focused on 
analyses of politics—including classes, farmers, and racial, religious, and 
ethnic groups, so the relationship of such groups to policy is problematic as 
well. 

Public policy is also critically affected —indeed, created and given mean- 
ing—by culture. Political issues, policy proposals, legislation, administrative 
regulations, judicial decisions, and their interpretation are all socially con- 
structed. Culture shapes both how the goals of public policy are defined and 
the "strategies of action" (Swidler 1986:277) employed to reach them (March 
& Olsen 1989, Sabatier 1988, Scott 1983b, Thomas et al 1987, Gusfield 
1981, Wildavsky & Tenenbaum 1981). 

Government organizations are active in formulating policy and deciding 
how it will be implemented as well; they are neither neutral arbitrators in 
conflicts among social groups nor the agents of dominant groups (Almond 
1988, Korpi 1989, Nordlinger 1981, Skocpol & Amenta 1986, Wilson 1980). 

Support for specific hypotheses about policy change is more problematic. 
Most studies focus on one or at most a handful of domains, and little thought 
has been given to how representative they are of all policy domains (see Heclo 
1978, Hansen 1987, Schlozman & Tierney 1986:ch. 15). In addition, as in 
most areas of sociology and the other social sciences, weaknesses in con- 
ceptualization and measurement of crucial concepts encourage caution when 
evaluating prior work. It is easy to assume that sociologists studying policy 
domains know more than they actually do. 

The study of policy domains raises many questions about the nature of 
democratic government in modern societies, and about the organization of 
society as well. Both the garbage can model adopted by Kingdon and the 
evolutionary model propounded by Carmines & Stimson suggest that the state 
of the world is ambiguous, interpretations of évents are malleable, organiza- 
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tions are relatively unstructured, involvement in particular issues is highly 
fluid, and the complexities of modern life are such that any particular histori- 
cal development could easily have turned out differently. What social scien- 
tists should expect of their causal models, on the one hand, and representative 
government, on the other, should be questions very much at the center of 


future work on policy domains. . 


ACKNOWLEDGMENTS 


I would like to thank Ron Jepperson, Florence Katz, David Knoke, Peter 
May, W. Richard Scott, and an anonymous reviewer for their helpful advice 


and comments. 


Literature Cited 


Alford, R. R., Friedland, R. 1985. Powers of 
Theory. New York: Cambridge Univ. Press 

Almond, G. A. 1988. Return to the state. Am. 
Polit. Sci. Rev. 82:853-74 

Amenta, E., Carruthers, B. G. 1988. The 
formative years of U.S. social spending 
policies. Am. Sociol. Rev. 53:661—78 

Amenta, E., Clemens, E. S., Olsen, J., 
Parikh, A., Skocpol, T. 1987. The political 
origins of unemployment insurance in five 
American states. In Studies in American 
Political Development, ed. K. Orren, S. 
Skowronek, 2:137-182. New Haven: Yale 
Univ. Press 

Bendix, R., Lipset, S. M. 1966. The field of 
political sociology. In Political Sociology, 
Pm Coser, pp. 9-47. New York: Harper 

OW 


Berk, R. A., Brackman, L., Lesser, S. 1977. 
A Measure of Justice. New York: Academic 

Boli, John. 1987. Human rights or state ex- 
pansion? Cross-national definitions of con- 
stitutional rights, 1870-1970. See Thomas 
et al 1989, pp. 133-49 

Boli, J., Meyer, J. W. 1987. The ideology of 
childhood and the state. See Thomas et al 
1989, pp. 217-41 

Bosso, C. J. 1987. Pesticides and Politics. 
Pittsburgh: Univ. Pittsburgh Press 

Budge, I., Hofferbert, R. I. 1990. Mandates 
and policy outputs. Am. Polit. Sci. Rev. 
84:111-31 

Burk, J. 1985. The origins of federal securities 
regulation. Soc. Forces 63:1010—29 

Burstein, P. 1981. The sociology of democrat- 
ic politics and government. Annu. Rev. 
Sociol. 7:291-319 

Burstein, P. 1985. Discrimination, Jobs, and 
Politics. Chicago: Univ. Chicago Press 

Burstein, P. 1988. Political sociology. In The 
Future of Sociology, ed. E. Borgatta, K. 
Cook, pp. 166-78. Beverly Hills: Sage 


Burstein, P., Freudenburg, W. 1978. Chang- 
ing public policy. Am. J. Sociol. 84:99— 
122 


Caldeira, G. A., Wright, J. R. 1988. Orga- 
nized interests and agenda setting in the 
U.S. Supreme Court. Am. Polit. Sci. Rev. 
82:1109-27 

Carmines, E. G., Stimson, J. A. 1989. Issue 
Evolution. Princeton: Princeton Univ. Press 

Clausen, A. G. 1973. How Congressmen De- 
cide. New York: St. Martin’s Press 

Cobb, R. W., Elder, C. D. 1983. Participa- 
tion in American Politics. Baltimore: Johns 
Hopkins Univ. Press. 2nd ed. 

Cohen, M. D., March, J. G., Olsen, J. P. 
1972. A garbage can model of organization- 
al choice. Admin. Sci. Q. 17:125 

Costain, A. N. 1988. Activists, agitators, and 
issues. Pres. Ann. Meet. Am. Polit. Sci. 
Assoc., Washington, DC 

DiMaggio, P. J., Powell, W. W. 1983. The 
iron cage revisited: institutional isomorph- 
ism and collective rationality in organiza- 
tional fields. Am. Sociol. Rev.-48:147-60 

Duffy, G. 1989. The validity problem of con- 
tent analysis. Presented at Ann. Meet. Am. 
Polit. Sci. Assoc., Atlanta 

Edelman, M. 1988. Constructing the Political 
Spectacle. Chicago: Univ. Chicago Press 

Erikson, R. S., Wright, G. C. Jr., Mclver, J. 
P. 1989. Politica! parties, public opinion, 
and state policy in the United States. Am. 
Polit. Sci. Rev. 83:729-50 

Fiorina, M. 1974. Representatives, Roll Calls, 
and Constituencies. Lexington, Mass: Lex- 


ington 

Freeman, G. P. 1985. National styles and poli- 
cy sectors: explaining structured variation. 
J. Public Policy 5:467-96 

Gais, T. L., Peterson, M. A., Walker, J. L. 
1984. Interest groups, iron triangles, and 
representative institutions in American 


348 BURSTEIN 


national government. Br. J. Polit. Sci. 14: 
161-85 

Gamson, W. A. 1975. The Strategy of Social 
Protest. Homewood, Ill: Dorsey 

Gamson, W. A., Modigliani, A. 1987. The 
changing culture of affirmative action. Res. 
Polit. Sociol. 3:137-77 

Gamson, W. A., Modigliani, A. 1989. Media 
discourse and public opinion on nuclear 
power. Am. J. Sociol. 95:1-37 

Garnier, M., Hage, J., Fuller, B. 1989. The 
strong state, social class, and controlled 
school expansion in France, 1881-1975. 
Am. J. Sociol. 95:279-306 

Garrow, D. 1978. Protest at Selma. New 
Haven: Yale Univ. Press 

Glendon, M. A. 1989. The Transformation 
of Family Law. Chicago: Univ. Chicago 
Press 

Greenberg, G. D., Miller, J. A., Mohr, L. B., 
Vladeck, B. C. 1977. Developing public 
policy theory. Am. Polit. Sci. Rev. 71: 
1532-43 

Gusfield, J. R. 1981. The Culture of Public 
Problems. Chicago: Univ. Chicago Press 

Hansen, J. M. 1987. Choosing sides: the crea- 
tion of an agricultural policy network in 
Congress, 1919-1932. In Studies in Amer- 
ican Political Development, ed. K. Orren, 
S. Skowronek, 2:183-229. New Haven: 
Yale Univ. Press 

Hanson, R. L. 1983. The ‘content’ of welfare 
policy. J. Polit. 45:771~85 

Harrison, C. 1988. On Account of Sex: The 
Politics of Women's Issues, 1945-1968. 
Berkeley: Univ. California Press 

Heclo, H. 1978. Issue networks and the execu- 
tive establishment. In The New American 
Political System, ed. A. King, pp. 87-124. 
Washington, DC: Am. Enterprise Inst. 

Heidenheimer, A. J., Heclo, H., Adams, C. 
T. 1983. Comparative Public Policy. New 
York: St. Martin's Press 2nd ed. 

Hilgartner, S., Bosk, C. L. 1988. The rise and 
fall of social problems: a public arenas mod- 
el. Am. J. Sociol. 94:53-78 

Hofstetter, R. C. 1981. Content analysis. In 
Handbook of Political Communication, ed. 
D. D. Nimmo, K. R. Sanders. pp. 529-60. 
Beverly Hills: Sage 

Hooks, G. 1990. From an autonomous to a 
captured state agency: the decline of the 
New Deal in agriculture. Am. Sociol. Rev. 
55:29-43 

Jacob, H. 1988. Silent Revolution: The Trans- 
formation of Divorce Law in the United 
States. Chicago: Univ. Chicago Press 

Jankowski, R. 1988. Preference aggregation 
in political parties and interest groups: a 
synthesis of corporatist and encompassing 
organization theory. Am. J. Polit. Sci. 
32:105-25 


Janowitz, M. 1970. Political Conflict. Chica- 
go: Quadrangle 

Jenkins, J. C., Brents, B. G. 1989. Social 
protest, hegemonic competition, and social 
reform. Am. Sociol. Rev. 54:891-909 

Jenkins, J. C., Perrow, C. 1977. Insurgency 
of the powerless. Am. Sociol. Rev. 42:249— 
61 


Kelman, S. 1981. Regulating America, Regu- 
lating Sweden. Cambridge, Mass.: MIT 
Press 

Kelman, S. 1987. Making Public Policy. New 
York: Basic 

Kingdon, J. W. 1984. Agendas, Alternatives, 
and Public Policies. Boston: Little, Brown 

Kingdon, J. W. 1988. Ideas, politics, and 
public policies. Presented at Ann. Meet. 
Am. Polit. Sci. Assoc., Washington, DC 

Kingdon, J. W. 1990. How do issues get 
on public policy agendas? Pres. Ann. 
Meet. Am. Sociol. Assoc., Washington, 
DC 

Knoke, D. 1990. Networks as political glue. 
Pres. Ann. Meet. Am. Sociol. Assoc., 
Washington, DC 

Knoke, D., Burleigh, F. 1989. Collective ac- 
tion in national policy domains. Res. Polit. 
Soc. 4:187—208 

Knoke, D., Laumann, E. O. 1982. The social 
organization of national policy domains. In 
Social Structure and Network Analysis, ed. 
P. V. Marsden, N. Lin, pp. 255-70. Bever- 
ly Hills, Calif: Sage 

Knoke, D., Pappi, F. U. 1989. Fighting col- 
lectively: collective actors, action sets, and 
opposition networks in the U.S. and Ger- 
man labor policy domains. Pres. Ann. 
Meet. Am. Sociol. Assoc., San Francisco 

Korpi, W. 1989. Power, politics, and state 
autonomy in the development of social 
citizenship. Am. Sociol. Rev. 54:309-28 

Laumann, E. O., Knoke, D. 1987. The Orga- 
nizational State. Madison: Univ. Wisc. 
Press 

Lipset, S. M., Rokkan, S. 1967. Cleavage 
structures, party systems, and voter align- 
ments. In Party Systems and Voter Align- 
ments, ed. S. M. Lipset, S. Rokkan, pp. 
1-64. New York: Free 

Lowi, T. J. 1979. The End of Liberalism. New 
York: Norton. 2nd ed. 

McAdam, D. 1982. Political Process and the 
Development of Black Insurgency, 1930— 
1970. Chicago: Univ. Chicago Press 

McCarthy, J., Zald, M. 1977. Resource 
mobilization and social movements. Am. J. 
Sociol. 82:1212-41 

McDonagh, E. L. 1989. Issues and con- 
stituencies in the progressive era: House roll 
call voting on the nineteenth amendment, 
1913-1919. J. Polit. 51:119-36 

McFarland, A. S. 1987. Interest groups and 


theories of power in America. Br. J. Polit. 
Sci. 17:129-47 

March, J. G., Olsen, J. P. 1989. Rediscover- 
ing Institutions. New York: Free 

Mayhew, D. 1974. Congress: The Electoral 
Connection. New Haven: Yale Univ. Press 

Meyer, J. W., Boli, J., Thomas, G. M. 1987. 
Ontology and rationalization in the western 
cultural account. See Thomas et al 1987, 
pp. 12-37 

Meyer, J. W., Scott, W. R., with Rowan, B., 
Deal, T. E. 1983. Organizational Environ- 
ments. Beverly Hills: Sage 

Nelson, B. J. 1984. Making an Issue of Child 
Abuse. Chicago: Univ. Chicago Press 

Nordlinger, E. A. 1981. On the Autonomy of 
the Democratic State. Cambridge: Harvard 

. Univ. Press 

Ogburn, W. F. 1912. Progress and Uniformity 

"^in Child Labor Legislation. New York: 
Columbia Univ. Press 

Orloff, A. S., Skocpol, T. 1984. Why not 
equal protection? Am. Sociol. Rev. 49:726- 
50 


Page, B. 1978. Choices and Echoes in Presi- 
dential Elections. Chicago: Univ. Chicago 
Press 

Page, B., Shapiro, R. Y. 1983. Effects of 
public opinion on policy. Am. Polit. Sci. 
Rev. 77:175-90 

Paine, S. C. 1989. Persuasion, manipulation, 
and dimension. J. Polit. 51:36-49 

Pampel, F. C., Williamson, J. B. 1985. Age 
structure, politics, and cross-national pat- 
terns of public pension expenditure. Am. 
Sociol. Rev. 50:782-99 

Pampel, F. C., Williamson, J. B. 1988. Wel- 
fare spending in advanced industrial de- 
mocracies, 1950-1980. Am. J. Sociol. 93: 
1424-56 

Pavalko, E. K. 1989. State timing of policy 
adoption. Am. J. Sociol. 95:592-615 

Polsby, N. W. 1980. Community Power and 
Political Theory. New Haven: Yale Univ. 
Press. 2nd ed. 

Polsby, N. W. 1984. Political Innovation in 
America. New Haven: Yale Univ. Press 
Pressman, J. L., Wildavsky, A. 1984. Im- 
plementation. Berkeley: Univ. California 

Press, 3rd ed. 

Quadagno, J. 1989. The Transformation of 
Old Age Security, Chicago: Univ. Chicago 
Press 

Ramirez, F. O., Boli, J. 1987. Global patterns 
of educational institutionalization. See Tho- 
mas et al 1987, pp. 150-72 

Ratner, R. S., ed. 1980. Equal Employment 
Policy for Women. Philadelphia: Temple 
Univ. Press 

Ratner, R. S., Burstein, P. 1980. Ideology, 
specificity, and the coding of legal docu- 
ments. Am. Sociol. Rev. 45:522-25 


POLICY DOMAINS 349 


Riker, W. H. 1986. The Art of Political 
Manipulation. New Haven: Yale Univ. 
Press 

Riker, W. H., Brams, S. J. 1973. The paradox 
of vote trading. Am. Polit. Sci. Rev. 68: 
1235-47 

Rose, R. 1985. The programme approach to 
the growth of government. Br. J. Polit. Sci. 
15:1-28 

Sabatier, P. A. 1988. An advocacy coalition 

" framework of policy change and the role of 
policy-oriented learning therein. Policy Sci. 
21:129-68 

Salisbury, R. H., Heinz, J. P., Laumann, E. 
O., Nelson, R. L. 1987. Who works with 
whom? Interest group alliances and opposi- 
tion. Am. Polit. Sci. Rev. 81:1217-34 

Schattschneider, E. E. 1960. The Semi- 
Sovereign People. New York: Holt 

Schlozman, K. L., Tierney, J. T. 1986. Orga- 
nized Interests and American Democracy. 
New York: Harper & Row 

Sclineider, A., Ingram, H. 1988. Systemati- 
cally pinching ideas: a comparative ap- 
proach to policy design. J. Public Policy 


Scott, W. R. 1983a. Reform movements and 
organizations: the case of aging. See Meyer 

- & Scott 1983, pp. 115-27 

Scott, W. R. 1983b. The organization of en- 
vironments. See Meyer & Scott 1983, pp. 
155-75 

Scott, W. R., Meyer, J. W. 1983. The organ- 
ization of societal sectors. See Meyer & 
Scott, pp. 129-53 

Simon, H. A. 1985. Human nature in politics. 
Am. Polit. Sci. Rev. 79:293-304 

Skocpol, T., Amenta, E. 1986. States and 
ot policies. Annu. Rev. Sociol. 12:131— 
7 


Snow, D. A., Rochford, E. B. Jr., Worden, S. 
K., Benford, R. 1986. Frame alignment 
processes, micromobilization, and move- 
ment participation. Am. Sociol. Rev. 51: 
464-81 

Steinberg, R. 1982. Wages and Hours. New 
Brunswick, NJ: Rutgers Univ. Press 

Steinberger, P. J. 1980. Typologies of public 
policy. Social Sci. Q. 61:185—97 

Stone, D. A. 1989. Causal stories and the 
formation of policy agendas. Polit. Sci. Q. 
104:281—300 

Stryker, R. 1990. Science, class, and the 
welfare state. Am. J. Sociol. 96:684— 
726 

Swidler, A. 1986. Culture in action. Am. 
Sociol. Rev. 51:273-86 

Thomas, G. M., Meyer, J. W., Ramirez, F. 
O., Boli, J., eds. 1987. Institutional Struc- 
ture. Newbury Park, Calif: Sage 

Tilly, C. 1975. Reflections on the history of 
European state-making. In The Formation 


350 BURSTEIN 


of National States in Western Europe, ed. 
C. Tilly, pp. 3-83. Princeton: Princeton 
Univ. Press 

Tufte, E. 1978. Political Control of the Econ- 
omy. Princeton: Princeton Univ. Press 

Walker, J. L. 1977. Setting the agenda in 
the U.S. Senate. Br. J. Polit. Sci. 7:423- 
46 

Walker, J. L. 1983. The origins and mainte- 
nance of interest groups in America. Am. 
Polit. Sci. Rev. 77:390-406 


Weiss, J. A. 1989. The powers of problem 
definition. Policy Sci. 22:97-121 

Wildavsky, A., Tenenbaum, E. 1981. The 
Politics of Mistrust. Beverly Hills: Sage 

Wilensky, H. L., Turner, L. 1987. Democrat- 
ic Corporatism and Policy Linkages. Berke- 
ley: Univ. California, Berkeley, Inst. Int. 
Stud. f 

Wilson, J. Q. 1980. The politics of regulation. 
In The Politics of Regulation, ed. J. Q. 
Wilson, pp. 357-94. New York: Basic 


Annu. Rev. Sociol. 1991. 17:351—83 
Copyright © 1991 by Annual Reviews Inc. All rights reserved 


ETHNIC MINORITIES AND 
MENTAL HEALTH 


William A. Vega 


School of Public Health, University of California, Berkeley, Berkeley, California 
94720 


Rubén G. Rumbaut 
Department of Sociology, San Diego State University, San Diego, California 92182 


KEY WORDS: minority groups, race-ethnicity, immigrants and refugees, mental health, psy- 
chiatric epidemiology, stress and adaptation 


Abstract 


The sociological study of the mental health of racial-ethnic minorities ad- 
dresses issues of core theoretical and empirical concern to the discipline. This 
review summarizes current knowledge about minority mental health and 
identifies conceptual and methodological problems that continue to confront 
research in this field. First, a critique is presented of epidemiological 
approaches to the definition and measurement of mental health in general, and 
minority mental health in particular, including an overview of the most 
frequently used symptom scales and diagnostic protocols. Next, the most 
important research studies conducted over the past two decades are summa- 
rized and discussed, and comparisons of prevalence rates and correlates of 
depressive symptomatology among Black, Hispanic, Asian, and American 
Indian ethnic groups are provided. Following the overview of descriptive 
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epidemiological findings, some key analytic issues surrounding the study of 
stress, adaptation and minority mental health are considered. Finally, we 
propose various recommendations for future research. 


INTRODUCTION 


Far from being a subject of idiosyncratic interest on the margins of sociologi- 
cal inquiry, the study of the mental health of racial-ethnic minorities addresses 
issues of core theoretical and empirical concern to the discipline. Classical 
social theory, notably in its formulation of the concepts of “alienation” and 
"anomie," sought to understand and explain the relationship between the 
individual and society, and the ways in which microlevel subjective experi- 
ence is embedded in macrolevel objective social structures. The origins of 
American sociology in the early twentieth century reflected a similar focus on 
the personal and behavioral consequences of conditions of social disorganiza- 
tion and marginality. Indeed, for the United States in particular, the develop- 
ment of systems of caste and class stratified by racial and ethnic statuses has 
been a central theme of its history, shaped over many generations by the 
European conquest of indigenous peoples and by massive waves of both 
coerced and noncoerced immigration from all over the world. After World 
War I, as a consequence of restrictive national-origins laws, the Great Depres- 
sion and World War II, immigration to the United States declined to its lowest 
point since the early nineteenth century, and by the 1950s "ethnicity" itself 
was seen as a fading phenomenom in the national scene. But since the 1960s a 
resurgence of ethnic consciousness among native minorities in the wake of the 
civil rights movement, followed by mass new waves of immigrants and 
refugees from the developing nations of Asia and Latin America, has trans- 
formed anew the American ethnic mosaic (Portes & Rumbaut 1990). In this 
rapidly changing context, what are the emotional consequences of “ethclass” 
(Gordon 1964) inequality in America—the "hidden injuries" not only of class 
but of minority status and ethnocultural distinctiveness? This essay is an effort 
to review current knowledge about the mental health of racial-ethnic minority 
groups in the United States, to identify conceptual and methodological prob- 
lems that continue to confront research in this field and to suggest avenues for 
future research. 

As a starting point, in order to place our review in a comparative frame 
of reference, Table 1 presents 1980 census data on the size and selected so- 
cioeconomic characteristics of racial-ethnic groups in the United States. 
Given the oft-noted relationship between social class and mental disorder, 
the data in Table 1 may begin to suggest some possible hypotheses about 
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the mental health of minority groups, as well as to raise some questions 
about the appropriateness of racial and ethnic categories used in official 
statistics. ! 

Non-Hispanic whites, blacks, American Indians, and Puerto Ricans are 
overwhelmingly native-born populations, as are Pacific Islanders (mainly 
Hawaiians, Guamanians and Samoans). By contrast, the other “Hispanic” and 
“Asian” minorities in Table 1 consist overwhelmingly of recent immigrants, 
with the notable exceptions of Mexican-Americans and Japanese-Ameri- 
cans. The 1990 census will reveal major changes in the size and status of 
these groups, particularly those who have grown dramatically as a re- 
sult of accelerating immigration over the past decade (Portes & Rumbaut 
1990). 

During the 1980s a total of six million immigrants and refugees were 
legally admitted into the United States, nearly half from Asian countries and 
the bulk of the rest from Latin America. As a result, Asian-Americans have 
been the nation’s fastest growing minority population; already by 1990 the 
Filipinos had surpassed the Chinese to become the largest Asian-origin ethnic 
group in the United States, a phenomenom that has gone largely unnoticed. 
Although they come from Third World countries, approximately one third of 
these new immigrants are well-educated "brain drain" professionals, primari- 
ly the Asian Indians, Filipinos, Koreans, Chinese, and Taiwanese. In addi- 
tion, sizable if unknown numbers of others entered extralegally. Over three 
million formerly undocumented immigrants qualified by 1989 for legalization 
of their status under the amnesty provisions of the Immigration Reform and 
Control Act of 1986 (IRCA); two thirds of them were Mexican nationals. 
Unauthorized immigrants (such as most coming from Central America) who 


!For example, over 180 million persons were classified by the 1980 census as “non-Hispanic 
whites," a sponge term absorbing many diverse ethnic groups—but excluding 14.6 million 
“Hispanics,” who are classified as such on the basis not of race but of language (even though 
many of them do not speak Spanish). Among Hispanics are counted two million Puerto Ricans 
living on the mainland, but not the larger number living on the island (who are US citizens by 
birth and who do universally speak Spanish). More clearly classified in terms of social origin are 
persons of Mexican and Cuban descent; less clear is the composition of the population of over 
three million lumped together as "Other Spanish," who may include a score of immigrant 
nationalities as well as indigenous groups. “Non-Hispanic” minorities are classified as different 
"races," ranging widely in size from "blacks" (26 million) to "American Indians and Alaska 
Natives" (1.4 million); in between are "Asians and Pacific Islanders" (3.5 million), who number 
among them far more diverse ethnic groups than any other single census category. Beginning 
with the 1980 census, this latter “racial” (or geographic-origin) aggregate was subdivided into 
several Asian national-origin groups. All this makes for a conceptually muddled typology of 
"racial-ethnic" groups, even if it is an improvement over previous censuses. 
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Table 1 Size and socioeconomic characteristics of major racial-ethnic groups in the United States, 
1980. (Source: US Bureau of the Census, 7980 Census of Population: General Social and Economic 
Characteristics, PC80-1-C1, December 1983, Tables 160—171) 


Nativity Education’ Income 
Persons Foreign-born High school Below Per capita 
graduate poverty income 
Racial-ethnic groups (N) (46) (96) (96) ($) 
Non-Hispanic 
White 180,602,838 3.9 69.6 8.9 7942 
Black 26,091,857 2.8 51.2 29.8 4556 
Asian and Pacific 3,550,605 59.2 74.9 13.1 7095 
Islanders: 
Chinese 812,178 63.3 71.3 13.5 7476 
Filipino 781,894 64.7 74.2 7.1 6915 
Japanese 716,331 28.4 81.6 6.5 9068 
Asian Indian 387,223 70.4 80.1 9.9 8667 
Korean 357,393 81.9 78.1 11.7 5544 
Vietnamese 245,025 90.5 62.2 35.5 3382 
Pacific Islanders 259,566 11.8 67.3 19.3 5106 
American Indian, 1,432,807 1.9 55.8 27.6 4618 
Eskimo and Aleut 
Race, n.e.c. 264,015 34.2 37.9 21.1 4887 
Hispanic? 14,603,683 28.6 44.0 23.5 4586 
Mexican 8,678,632 26.0 37.6 23.3 4231 
Puerto Rican? 2,004,961 3.0 40.1 36.3 3905 
Cuban 806,223 77.9 55.3 13.2 6839 
Other Spanish 3,113,867 39.4 57.4 18.6 5430 


Total population: 226,545,805 6.2 66.5 12.4 7298 


‘Percent of high school graduates among persons aged 25 years or older. 
Persons of Spanish origin may be of any race. 
Puerto Ricans residing in the U.S. mainland only. 


entered the country after 1981 were ineligible for legalization of their status 
under IRCA, and it is uncertain to what extent they will be enumerated by 
the 1990 census, although their often traumatic contexts of exit and harsh 
contexts of reception may place them at higher risk for mental health 
problems. 

All of these groups show considerable diversity in their levels of education 
and income. Indeed, the 1980 census data suggest the formation of new 
"ethclasses," and these differences have probably widened during the past 
decade. Among the groups listed in Table 1, several exceeded the levels of 
educational attainment of non-Hispanic whites; without exception all of these 
were of Asian origin (Japanese, Indians, Koreans, Filipinos, and Chinese). 
To varying degrees all other racial-ethnic minorities fell substantially below 
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majority-group norms in education and income, and the data do not readily 
fit into catch-all categories such as “Hispanic” or “Asian.” For example, 
the highest poverty rates in 1980 were observed among Puerto Ricans, 
Vietnamese, Blacks, American Indians, Mexican-Americans, and Pacif- 
ic Islanders. Since 1980 an accumulating body of research has noted still 
higher levels of poverty among “second-wave” Cambodians, Laotians, and 
Vietnamese, Haitian and Central American escapees, Mariel Cubans, 
and Afghan and Ethiopian refugees. This diversification of patterns of so- 
cial stratification among minority groups, including recent Asian and Latin 
American immigrants, and of their different modes of incorporation into 
the American economy and society, sets the stage for a consideration of pat- 
terns of inequality in mental health outcomes as well (Portes & Rumbaut 
1990). 


Defining Mental Health and Mental Disease 


A core dilemma, historically, has been the challenge of defining mental 
health, as well as of knowing how to measure it satisfactorily. The contempo- 
rary mental health arena transcends the disciplinary boundaries of psychiatry, 
psychology, social work, anthropology, genetics, and other fields in addition 
to sociology. It is no wonder that controversy has been rampant. This issue is 
no mere academic matter but goes straight to the heart of etiological debates, 
professional training, and the organization and payment of services; it also 
heavily influences the direction of funded research (Vega & Murphy 1990). 
Therefore, this is ultimately a political issue and responsive to political forces. 
In the past, this situation has often found minority researchers, as well as 
others interested in minority mental health, observing the development of new 
knowledge and services from the sidelines, unable to influence the course of 
events. 

The term “mental health” was originally intended to reflect psychological 
well-being and resilience; in essence, a satisfactory if not optimal state of 
being. This term arose as a precursor to the community mental health move- 
ment of the mid-1950s and was a reaction to bleak images of refractory mental 
disease and institutionalized treatment which predominated until that time 
(Barter 1983, Biegal & Levenson 1972). Nonetheless, despite this optimistic 
facade, researchers and services providers, from that time until the present, 
have been narrowly focused on mental disorders. Psychiatrists have played 
the dominant role in setting the “parameters” of mental health as a subspecial- 
ity of medicine, and in defining the content of psychiatric disorders (Millon & 
Klerman 1986). The result is that the content of mental health research has 
been studied repeatedly from a nonnormative perspective, and for many years 
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the research literature was derived primarily from clinically based studies and 
rooted in a disease model (Fabrega 1990). More recently, a return to biologi- 
cal explanations of mental illness has further discouraged the search for social 
origins (Kleinman 1988). 

Minority mental health has been seriously understudied because there have 
been few minority researchers, and people of color often have not been 
represented in the clinical patient populations used to develop the epidemio- 
logic data base over decades of research. Thus, any bearing that culture may 
have on the manifestation, perception, recognition, and salience of psychiatr- 
ic symptoms within ethnic minority groups is almost completely overlooked 
in contemporary psychiatric epidemiology (Good & Good 1986). A review of 
contemporary journals, except for those explicitly tied to psychiatric an- 
thropology or minority research, leads quickly to the conclusion that culture 
serves only to camouflage "pure" psychiatric phenomena. The methodolog- 
ical challenge for contemporary researchers is how best to parcel out cultural 
influences to achieve more accurate measurement and understanding of psy- 
chiatric problems. 


Descriptive Epidemiology and Minority Mental Health Status 


The advent of large field surveys for estimating the "true prevalence" of 
psychiatric disorders in noninstitutional populations began in earnest in the 
1950s with data reported from the Midtown Manhattan Study (Srole et al 
1962) in New York City and the Stirling County Study (Leighton et al 1963) 
in Canada, using symptom checklists designed to measure the severity and/ 
or duration of symptoms. These short scales were designed to assess a pool 
of symptoms commonly observed among patient populations, and their cri- 
terion validity was demonstrated by comparing clinical to nonclinical pop- 
ulations. Most commonly, these checklists were created to tap depressed 
affect, psychological distress, and level of dysfunction—including social 
role performance. Link & Dohrenwend (1980) have noted that these short 
scales seem to cover a constellation of symptoms that typify psychological 
distress in its most common expression in the United States, and that 
these symptoms are commonly associated with low socioeconomic status, 
stress, and physical health problems. According to these authors, these 
types of symptoms fit the concept of “demoralization” originally postulated 
by Jerome Frank (1973), rather than a clinical diagnosis of “major depres- 
sion” (a more profound and enduring incapacitation). In using symptom 
checklists for estimating the community prevalence of mental health prob- 
lems, researchers have also made patient/nonpatient comparisons and estab- 
lished a cutoff point that represents "caseness." The "caseness" standard is 
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intended to identify individuals who have symptom levels similar to those 
of patients in treatment.” 

Few of these symptom measures were developed with specific minority 
patient populations. Therefore, it is unclear how patient symptom presenta- 
tion varies across cultures or within distinct language domains, and, in turn, 
how cultural expectations might affect the technical performance or in- 
terpretation of scale scores (Manson et al 1985). Thus, there is no a priori 
reason for accepting a preset “caseness” threshold on a particular measure as a 
valid standard for diverse ethnic minority groups. As Rogler et al (1989) 
noted regarding the measurement of psychopathology among Hispanics: “De- 
spite much concern about test bias and misdiagnosis in the psychiatric in- 
terview, research on methods of evaluating mental illness in Hispanics is 
scant, unsystematic, and steeped in experimental confusion” (p. 90). 

More recently the Center for Epidemiologic Studies-Depression measure 
(CES-D) (Radloff & Locke 1986) has become the most popular of these scales 
both because common instrumentation is needed for comparative purposes, 
and because this scale is said to reduce the contaminating effects of physical 
health symptoms that are commonly found on some symptom measures. 
However, eliminating health complaints from symptom checklists introduces 
a potential cultural bias. Minorities disproportionately experience health prob- 
lems because they are disproportionately of low socioeconomic status, and 
they are likely to suffer psychophysiological distress and depressive mood as 
a consequence (Kolody et al 1986). It has also been observed that some ethnic 
minority groups “somatize” psychological problems. According to Kleinman 
(1986), somatization occurs when “individuals experience serious personal 


For example, the symptom checklist most widely used with both national and community 
samples since the 1970s is the 20-item Center for Epidemiological Studies-Depression Scale, or 
CES-D (NCHS 1980). Items cover depressed mood, including feelings of guilt, worthlessness, 
helplessness, and hopelessness; and psychophysiologic manifestations such as loss of appetite 
and sleep disturbance. Each item has a range of four responses indicating how often the 
respondent had felt that way during the past week, coded as follows: 0 = rarely or none of the 
time (<1 day), 1 = some of the time (1—2 days), 2 = occasionally (3—4 days), 3 = most or all of 
the time (5—7 days). The coded values for all 20 items (with positively worded or “nondepressed” 
items reverse-scored) are summed into a total CES-D score, which may range from 0 to 60. 
Scores of 16 or higher are interpreted to mean high levels of depressive symptomatology, or 
“caseness,” because they represent roughly the upper quintile of test scores for patient pop- 
ulations on which the CES-D instrument was originally validated by the National Institute of 
Mental Health. Some sample CES-D items include: “I felt depressed,” “I felt sad,” “I thought my 
life had been a failure,” “I felt hopeful about the future,” “I enjoyed life,” “I had crying spells,” “I 
had trouble keeping my mind on what I was doing.” For a good overview of the properties and 
findings of widely-used symptom checklists—including the GWB (General Well-Being Scale), 
the HOS (Health Opinion Survey), the Langner screening index, and the CES-D—see Link & 
Dohrenwend (1980). 
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and social problems but interpret them and articulate them, and indeed come 
to experience and respond to them, through the medium of the body" (p. 51). 
Another dimension of somatization is the amplification of pain and other 
bodily sensations via cognitive or mood changes. The potential result may be 
an undercount of mental health problems, especially depression, among US 
minority groups. 

Symptom checklists are not designed to mimic the unique symptom con- 
figurations of discrete psychiatric disorders. For example, they could not be 
used for case ascertainment of schizophrenia or major depression. Therefore, 
until quite recently, case ascertainment could only be done via a clinical 
psychiatric interview, and even these procedures seldom achieved more than a 
modest degree of reliability among raters (Spitzer & Fleiss 1974). However, 
by the late 1970s, a diagnostic protocol suitable for field survey research, 
which could be administered by lay interviewers, had been developed to 
reflect the classification system of the Diagnostic and Statistical Manual-Ill 
of the American Psychiatric Association (Spitzer & Endicott 1978). For 
example, the Diagnostic Interview Schedule (DIS) (Robins et al 1981) is a 
fully structured interview that reduces to a minimum the need or desirability 
for interviewer judgments; it requires the presence of a certain number of 
symptoms and exclusions which are judged within the context of their recen- 
cy, duration, and severity.? Despite the fact that cross-cultural validity is 
dubious for many diagnostic categories, the DIS was used to conduct five 


?A diagnostic protocol such as the DIS requires (a) the presence of specific symptoms in order 
to (D) “trigger” a more complete inventory of symptoms which in tum permits (c) a final diagnosis 
of mental disorder. Operationally, this distinguishes the diagnostic approach from the additive 
symptom count method used in checklists. The example provided below illustrates the process of 
meeting criteria for a DSM-IIIR disorder using the DIS. Following 25 questions (Qs. 13-37) 
covering depressive symptoms, which form one part of the DIS—such as questions 36 and 37 
below—the interviewer is prompted to check whether at least three symptoms have persisted for 
at least one year. If this is the case, an additional criterion (A) tbat taps lifetime sad-blue- 
depressed affect is inquired about, as illustrated by question 38. If both of these requirements are 
met, then more depression items are asked. But if these criteria are not met, then the interviewer 
skips to another section of the DIS that deals with phobias. Responses to questions like 36 and 37 
below are coded in terms of recency (< 2 weeks, < 1 month, < 6 months, < J year, > 1 years). 
The actual diagnosis is made by a computer scoring program. 

Q. 36. Have you ever felt so low you thought of committing suicide? 

Q. 37. Have you ever attempted suicide? 

Q. 38. You said you've had a period of feeling depressed or blue and also said you've had 
some other problems like (LIST ALL 5's in Q's 13-37). Has there ever been a time when the 
feelings of depression and some of these other problems occurred together—that is, within the 
same month? (If YES, go to Q. 40; if NO, ask A:) 

A. If NO: So there's never been a period when you felt sad, blue, or depressed at the same 
time you were having some of these other problems? (If HAS BEEN a period, go to Q. 40; if 
NEVER BEEN, go to Q. 48). 
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large regional field surveys, known as the Epidemiologic Catchment Area 
Program (ECA) (Regier et al 1984), and this measure currently constitutes the 
standard in the field for psychiatric epidemiologists. Nevertheless, symptom 
checklists are commonly employed in health and mental health research 
because they are quick and easy to administer and have certain advantages for 
data analysis purposes, and because they gather a wide spectrum of informa- 
tion about respondents’ psychological states that might otherwise be lost using 
a diagnostic protocol such as the DIS. 

Kleinman & Good (1985) have noted a tendency to overlook variation in 
the expression of mental disorder when developing nosological categories. 
The assumption is that psychiatric symptoms or syndromes are universally 
distributed and uniformly manifested. The implicit etiologic thinking that 
supports a “one-size-fits-all” taxonomy is sympathetic to the “official sci- 
ence” of funding agencies such as the NIMH, who currently favor supporting 
research in neuropathology and genetics to explain psychiatric disorders. 
Simply put, if the genesis of psychiatric disorder is rooted in molecular 
biology which is universal, cultural and social factors diminish in significance 
(Fabrega 1991, Vega & Murphy 1990). 

Presently, a law of the instrument prevails in mental health research. 
Checklists and diagnostic protocols are invented, surveys are conducted, and 
the data are presented. Even in the glaring absence of culturally grounded 
validity studies, the very existence of data tends to objectify the protocol used 
to gather the information in the first place. Moreover, the use of statistical 
procedures for establishing internal consistency of scale items serves to 
mollify and obscure concerns about the construct validity of instruments when 
used in multicultural populations. The result is an epidemiologic research 
literature that reflects a general disregard for both the cultural content and the 
social context of mental health problems. 


COMPARISONS OF MINORITY GROUP RATES 


How best to reliably measure mental health or disorder remains obscure. 
Nonetheless, the pragmatic course is to identify the prevalence rates found 
in existing field studies and to examine how rates vary with other factors 
that were also measured in these studies. To make this comparison manage- 
able within space limitations, only selected studies reported in the last two 
decades are included. As seen in Table 2, these studies have used a wide 
variety of symptom checklists for making prevalence estimates. In addition, 
as Table 3 shows, some recent and exceptionally well-funded studies have 
been able to use diagnostic protocols. Studies reporting rates of diagnostic 
disorders are here limited to major depression (as measured by the DIS), 
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Table 2 Summary of contemporary research on symptom levels among US ethnic minorities 


Sample 
Study site and ethnic groups (N) Measure Mean 
United States 2867  CES-D 
Non-Hispanic Whites 2625 8.4 
Blacks 242 10.9 
Los Angeles, California 934  CES-D 
Non-Hispanic Whites 609 8.5 
Blacks 124 10.5 
Hispanics 201 10.6 
Norfolk, Virginia CES-D 
Blacks 1018 13.2 
Alameda, California 528 | CES-D 
Non-Hispanic Whites 219 NA 
Blacks 187 NA 
Mexican-Americans 122 NA 
Southwestern U.S. CES-D 
Mexican-Americans: 3555 
Males 6.3 
Females 9.3 
Miami, Florida CES-D 
Cuban-Americans: 857 
Males 3.7 
Females 7.0 
Seattle, Washington CES-D 
Asian-Americans: 499 9.4 
Chinese: 122 6.9 
Males 6.5 
Females 7.2 
Japanese: 129 7.3 
Males 8.0 
Females 6.7 
Filipinos: 128 9.7 
Males 9.1 
Females 10.5 
Koreans: 105 14.4 
Males 14.9 
Females 13.6 
San Francisco, California CES-D 
Chinese: 360 11.6 
Males i 10.2 
Females 12.8 
Chicago, Illinois CES-D 
Koreans: 622 12.6 
Males 12.3 
Females 12.9 
United States CES-D 


American Indians NA NA 


Caseness 
rate 


15.8 
23.0 


15.6 
21.8 
27.4 


34.5 


14.6 
18.1 
28.9 


8.0 
18.7 


4.1 
14.8 


19.1 
NA 
NA 
NA 
NA 
NA 
NA 
NA 
NA 
NA 
NA 
NA 
NA 


24.2 
NA 
NA 


NA 
NA 
NA 


48.0 


Investigator 


Eaton & Kessler 1981 


Frerichs et al 1981 


Gary et al 1989 


Vernon & Roberts 19 


Moscicki et al 1989 


Narrow et al 1990 


Kuo 1984 


Ying 1988 


Hurh & Kim 1988 


NCAIANMHR 1989a 
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Table 2 (continued) 


Sample Caseness 
Study site and ethnic groups (N) Measure Mean rate Investigator 
Southeast United States CES-D Manson et al 1991 
American Indians NA NA 58.0 
(ages 12-20) 
United States GWB Fazio 1977 
General Population: 6913 80.3 25.9 
Males NA 20.1 
Females NA 30.8 
San Diego, California 500 GWB Rumbaut 1985 
Time 1 
Vietnamese: 157 61.0 71.8 
Males 61.5 70.6 
Females 60.3 73.2 
Chinese-Vietnamese: 114 64.8 67.3 
Males 67.1 58.2 
Females 62.5 75.9 
Hmong: 109 55.2 87.2 
Males 55.8 ,84.0 
Females 54.7 89.8 
Cambodians: 120 58.1 87.4 
Males 57.9 89.7 
Females 58.3 85.2 
Time 2 Rumbaut 1989 
Vietnamese: 157 71.3 49.4 
Males 74.4 38.8 
Females 67.6 62.0 
Chinese-Vietnamese: 114 66.8 69.0 
Males 69.7 60.0 
Females 64.0 77.6 
Hmong: 109 64.8 64.5 
Males 65.7 63.3 
Females 64.1 65.5 
Cambodians: 120 60.6 85.0 
Males 63.1 77.6 
Females 58.4 91.9 
Santa Clara, California 1684 FHS Scales Meinhardt et al 1985-19 
General Population: NA NA 14.8 
Chinese (non-refugee) 620 NA 9.1 
Chinese (refugee) 120 NA 19.2 
Vietnamese (pre-1975) 199 NA 14.6 
Vietnamese (post-1975) 364 NA 26.6 
Cambodians 378 NA 51.9 
Northern Florida 1633 FHS Scales Warheit et al 1973 
Non-Hispanic Whites: 1267 
Males 14.0 10.8 


Females 17.0 21.1 
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Table 2 (continued) 


Study site and ethnic groups 


Blacks: 
Males 
Females 
Northern Florida 
Non-Hispanic Whites 
Blacks 
Alameda, California 
"Study 11" 
Non-Hispanic Whites 
Mexican-Americans 
“Study 12" 
Non-Hispanic Whites 
Mexican-Americans 
Southern Californía 
Non-Hispanic Whites 
Mexican-Americans 
Santa Clara, California 
Non-Hispanic Whites 
Mexican-American 
(English spk) 
Mexican-American 
(Spanish spk) 
Santa Clara, California 
Non-Hispanic Whites 
Mexican-American 
(English spk) 
Mexican-American 
(Spanish spk) 
Fresno, California 
Mexican-Americans: 
Males (farmworkers) 
Females (farmworkers) 
Chicago, Illinois 
Koreans: 
Males 
Females 
Alaska 
Eskimo children 
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Sample 


(N) 
366 


4176 
3469 
707 


2292 
162 


692 
255 
488 
237 
251 
1188 
637 
330 


221 
1176 
635 
320 


221 


500 


622 


NA 


Measure 


Depression Index 


Depression Index 


Depression Index 


Depression Index 


HOS 


HOS 


HOS 


HOS 





Mean 


18.0 
21.2 


13.6 
16.5 
11.6 


13.0 


13.4 
14.4 


14.0 
14.9 


11.6 
12.4 


16.6 


25.8 


25.9 


28.6 


27.8 
28.4 


29.6 
31.0 


NA 


Caseness 
rate 


20.8 
33.8 


13.9 
26.0 
14.1 


20.2 


14.0 
18.1 


12.2 
15.9 


11.9 
15.5 


27.1 


5.5 


6.6 


15.4 


19.9 
19.2 


NA 
NA 


49.0 


Investigator 


Warheit et al 1985 


Roberts 1981 


Griffith 1985 


Vega et al 1984a 


Vega et al 1985b 


Vega et al 1985 


Hurh & Kim 1988 


Kleinfeld & Bloom 1 


since this is one of the most common psychiatric problems in the United 


States. 


These studies represent a broad overview of research conducted in the 
1970s and 1980s with ethnic minority groups in the United States. There is 
substantial variance in the power of their respective designs, and many studies 
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Table 3 Lifetime prevalence of major depressive disorders among US ethnic minorities 


Sample 
Study site and ethnic groups (N) Measure Caseness rate Investigator 
Northern Florida 2097 DIS Warheit et al 1986 
Non-Hispanic Whites 1647 9.3 
Blacks 450 2.8 
Norfolk, Virginia DIS Gary et al 1989 
Blacks 865 5.5 
United States DIS Robins et al 1984 
New Haven, Connecticut: 3058 
Non-Blacks 2638 6.8 
Blacks 420 5.7 
St. Louis, Missouri: 3004 
Non-Blacks 1846 5.7 
Blacks 1158 4.9 
Baltimore, Maryland: 3481 
Non-Blacks 2299 i 3.8 
Blacks 1182 3.7 
Los Angeles, California 2552 DIS Karno et al 1987 
Non-Hispanic Whites 1309 8.4 
Mexican-Americans 1243 4.9 
Los Angeles, California DIS Burnam et al 1987 
Mexican-Americans: 1244 
Immigrants 707 3.3 
U.S.-born 537 : 6.9 
Miami, Florida DIS Narrow et al 1990 
Cuban-Americans: 857 
Males 2.4 
Females 3:7 
United States DIS Moscicki et al 1987 
Miami, Florida: 
Cuban-Americans 902 3.9 
Southwest: 
Mexican-Americans 3555 4.2 
New York City: 
Puerto Ricans 1343 8.9 
Puerto Rico DIS Canino et al 1987 
Puerto Ricans 1551 4.6 





have been conducted with rather small or otherwise idiosyncratic samples, 
thereby limiting their generalizability. The reader is cautioned that some of 
the symptom studies are not directly comparable because of dissimilar in- 
strumentation and design, or diverse criteria used to designate “caseness.” To 
make the review of findings parsimonious, a discussion of particular groups is 
presented in sequence under four broad racial-ethnic categories: blacks, His- 
panics, Asians, and American Indians. 
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Black Mental Health Status 


Our knowledge regarding the prevalence of mental disorders among persons 
of African-American heritage in the United States has passed through three 
stages: anecdotal accounts; information from treatment records; and data from 
field surveys of the general population that used statistical probability sam- 
pling procedures. Only a brief review of the first two sources will be de- 
scribed inasmuch as they have serious methodological shortcomings which 
are self-evident. A more detailed description of findings produced by 
epidemiologic surveys of the general population is then presented, since these 
findings represent the most scientifically defensible information presently 
available. 

The anecdotal accounts of mental disorders among blacks that found their 
way into the literature were often a blend of myth and stereotype, and they 
sometimes reflected an implicit racism on the part of those doing the report- 
ing. Prior to the establishment of Community Mental Health Centers as 
provided by federal legislation in 1963, few blacks received mental health 
treatment in noninstitutional settings and, moreover, a great many states, 
especially those located in the deep south, had only a token number of 
state-operated psychiatric beds set aside for the treatment of blacks. The 
absence of blacks in treatment during the first three decades of this century led 
a number of writers to conclude, erroneously, that blacks had lower rates of 
mental disorder than whites inasmuch as prevalence estimates were based 
almost entirely on data provided by state and county governments. Moreover, 
some writers alleged that the absence of blacks in treatment in public facilities 
was occasioned by a trouble-free existence and/or the special care and 
supervision they received as slaves (Witmer 1891, Babcock 1894, O'Malley 
1914, Powell 1896). 

The first systematic information on the mental health status of Blacks at a 
national level was derived from the decennial censuses and from the reports of 
treatment from the various states. It is worth noting in passing, however, that 
until 1970 the US Bureau of the Census did not have a separate classification 
for blacks. Prior to this period, they were included as part of a “nonwhite,” 
catch-all category. In 1930, the Census Bureau collected information, for the 
first time, on mental health treatment patterns in state hospitals throughout the 
United States. Nonwhites were found to have higher rates of "insanity" than 
whites in 1930 and in every census report since then. 

A number of investigators, most notably Malzberg, recognized the con- 
taminating influences of the differential access to psychiatric care in various 
regions of the United States. However, in reporting his own findings, he 
dismissed this source of error and claimed that racial discrimination did not 
exist in New York's treatment of different races and, therefore, that his 
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Malzberg went on to publish scores of reports which consistently showed 
higher rates of mental illness among blacks than whites, and his findings were 
often accepted by others as being accurate. No critical scientist today would 
give credence to Malzberg's findings as representing an accurate assessment 
of the true prevalence of mental disorders among blacks or whites. His data, 
of course, were at best representative only of those persons in treatment in 
New York state mental hospitals. Nonetheless, it is important to note that the 
findings reported by Malzberg were consistent with those of others using 
treatment data. For example, Faris & Dunham (1939), in their classic early 
study based on treatment rates in Chicago during the late 1920s and early 
1930s, found that blacks had higher rates of treated disorders in some 
categories but lower rates than whites in others. Again, these findings, like 
those of others relying on treatment rates, do not offer true prevalence 
estimates. : 

The epidemiologic field survey approach is obviously superior to anecdotal 
and treatment methods because it provides information on a representative 
sample of the general population. Still, the two most widely cited early field 
surveys of the general population, the Midtown Manhattan and the Stirling 
County studies, did not contain enough blacks in their samples to establish 
caseness or impairment rates for this population. Moreover, of the eight 
studies later cited by Dohrenwend & Dohrenwend (1969) that included 
subsamples of blacks and whites, half of them reported higher rates for blacks 
while the other half reported higher rates for whites. 

The first major epidemiologic field survey to contain a large enough sample 
of blacks to make cross-racial comparisons was conducted by the National 
Center for Health Statistics (NCHS). The Cycle 1 Health Examination Survey 
was conducted in 1960-1962 using a probability sample of 7710 adults aged 
18-79. The interview schedule included questions about feelings of nervous 
breakdown, being fidgety and tense, hands trembling, trouble sleeping, and 
so forth. Whites were found to have significantly higher mean score values 
than blacks, and this was the case for both males and females (NCHS 1970). 

A second major national survey conducted by the NCHS included data on 
mental health symptoms collected during 1971—1975 as part of the Health and 
Nutrition Examination Survey (HANES) of 6913 adults aged 25-74. In this 
initial HANES study, two new symptom scales were introduced: the CES-D 
and the General Well-Being Index (GWB). Blacks were found to have 
significantly higher distress scores than non-Hispanic whites on the 20-item 
CES-D, and lower well-being scores on the 18-item GWB. For example, the 
mean CES-D score for Blacks was 11.1 and for whites it was 8.4 (NCHS 
1980). 

The first large epidemiologic field survey to obtain extensive psychiatric 
data on blacks and whites living in the same geographic area was conducted 
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by Schwab & Warheit in north central Florida in the early 1970s. As reported 
by Warheit et al (1973, 1975), blacks were found to have higher rates of 
psychiatric symptoms and dysfunctions than whites on a number of scales— 
e.g. anxiety, depression, phobias, etc—but most of these differences were not 
statistically significant when the data were analyzed using regression pro- 
cedures and other multivariate techniques that controlled for education, occu- 
pational status and family income. 

The most comprehensive epidemiologic field surveys ever conducted in the 
United States were part of the Epidemiological Catchment Area projects 
funded under the auspices of the NIMH. Altogether, these five studies (in 
New Haven, Connecticut; Baltimore, Maryland; St. Louis, Missouri; the 
Piedmont area of North Carolina; and Los Angeles, California) secured 
information by making diagnostic judgments on 18,571 adults using the DIS. 
Of this total, 4,287 were non-Hispanic blacks. The general pattern, as seen in 
Table 3, is that blacks have similar or slightly lower rates of lifetime major 
depressive disorder compared to non-Hispanic whites, as measured by the 
DIS. To date, however, few detailed comparisons have been published about 
prevalence patterns beyond those reported by Robins et al (1984) and Somer- 
vell et al (1989). 

Between 1983 and 1986 Warheit and his colleagues conducted another field 
survey using the DIS in a north-central Florida SMSA of approximately 
200,000 persons. As part of this study, they obtained the information neces- 
sary to make DSM-IIIR diagnoses of alcohol abuse dependence and major 
depressive episodes. Resembling the patterns reported by the ECA program, 
it was found that 9.3% of non-Hispanic whites and 2.7% of blacks met criteria 
for a diagnosis of major depressive disorder in their lifetime (Warheit & Auth 
1986). Paradoxically, Blacks in this study—who were also administered the 
CES-D—actually had significantly higher levels of depressive symptoms on 
the CES-D checklist, but significantly lower rates of lifetime major depres- 
sion as measured by the DIS (Warheit 1990, personal communication). These 
results appear to be an artifact of both the diagnostic logic employed in the 
DIS? and the differential effects of recall between white and black respon- 
dents. That is, lower-SES blacks in this study were very unlikely to meet 
criteria for lifetime major depression based on recalled symptoms occurring 
more than one year before the DIS interview, whereas most whites who met 
the same caseness criteria reported depressive episodes that had occurred 
more than one year before the DIS interview. 

The results of symptom studies over the previous two decades continue to 
be inconsistent. Some symptom studies report higher mean scores and case- 
ness rates for Blacks than for non-Hispanic whites or Hispanics. But other 
variations are also reported, such as that blacks have lower symptom levels 
than Mexican-Americans or non-Hispanic whites. The study by Gary et al 
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(1989) reports extraordinarily high mean scores on the CES-D, and over one 
third of blacks met empirically defined criteria for caseness on this 
nondiagnostic scale. These differences in findings are not easily resolved. 
They could be real, reflecting variations in samples and locations, or they 
could be artifactual, a product of design bias or measurement error. 

The patterning of symptoms by sociodemographic variables is similar to 
those reported commonly in the epidemiologic literature (Mirosky & Ross 
1989). A curvilinear relationship between age and symptom levels is found, 
and women generally have higher symptoms levels than males. However, 
these gender differences are contradicted in a few studies, where no differ- 
ences in adjusted rates are found between black men and women. Most 
studies describe an inverse relationship between education and depressive 
symptom levels. However, some studies have reported that it is not the lowest 
educational stratum, but the second to lowest stratum (secondary school level) 
that have the highest symptom levels. 


Hispanic Mental Health Status 


Contemporary understanding about Hispanic mental health emerges from 
diverse strands of research. A body of ethnographic research was developed 
in the 1950s and early 1960s depicting Mexican-American family dynamics 
and sociocultural practices (Saunders 1954, Madsen 1964). The collective 
picture that emerged from this research, which was mostly conducted in South 
Texas, was that extended family cohesiveness and traditional values were 
believed responsible for a lower prevalence rate of mental health problems. 
Mexican culture provided a cushion for absorbing the rude shocks of life. 
Other findings described deeply held supernatural belief systems and a well- 
articulated system of natural healers that operated as a functional alternative to 
traditional psychiatric services among Mexican Americans. In later years, 
similar research emerged about the role of folk beliefs and folk providers 
among Puerto Ricans (Ruiz & Langrod 1976) and Cuban Americans (Sando- 
val 1979) as well. 

Among the first rates-in-treatment studies available is the older work by 
Jaco (1960), which indicated that Mexican Americans had lower rates of 
admission to Texas mental hospitals than did non-Hispanic whites. The lower 
utilization of services by Mexican Americans was reported in biometry data 
from many sites in the Southwestern United States, stimulating a controversy 
about whether this situation was due to a lower prevalence of psychiatric 
disorders, or merely organizational barriers to services. Mexican culture was 
believed to somehow buffer Mexican Americans from stressors and to make 
them more tolerant of deviant behavior among family members. 

However, some later ethnographic studies in Los Angeles disconfirmed the 
notion that all Mexican Americans relied on natural healers and folk beliefs to 
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deal with mental health problems (Edgerton et al 1970). That research in- 
dicated that Mexican Americans were similar to non-Hispanic whites in their 
ability to accurately distinguish psychiatric problems from somatic disorders 
using short case vignettes, but that they were likely to indicate that physicians 
were the providers of choice for mental health problems (Karno & Edgerton 
1969). This finding was bolstered by reports by Mexican-American respon- 
dents, especially the least acculturated ones, that they viewed psychiatric 
disorders as biologically based. This research was and remains important 
because it demonstrated that even within the same ethnic group, Mexican 
Americans, wide variations in cultural practices are found. These differences 
could be occasioned by regional cultural patterns, urban-rural or SES differ- 
ences, cohort differences, recency of migration, or other factors. It is prudent 
to add that many of these issues remain unresolved, but the advent of 
widespread epidemiologic surveys in the 1970s provided an opportunity to 
move away from conjectures about prevalence based on ethnographic and 
rates-in-treatment studies. 

The results of studies reported in Tables 2 and 3 are interesting but 
inconclusive. For example, for Mexican Americans some studies show higher 
symptom rates than for non-Hispanic whites or Blacks. But another study, the 
recent Hispanic Health and Nutrition Examination Survey (HHANES) re- 
ported by Moscicki et al (1989), which has an exceptionally large regional 
sample drawn from the Southwestern United States, reports low symptom 
levels. Furthermore, results vary even among those studies which used the 
same symptom checklist, the CES-D, for making estimates. This variation 
could be a methodological artifact, involving administration mode or sam- 
pling effects, or it could indicate real differences in the populations sampled. 
On the other hand, the Los Angeles ECA study reported lower rates of major 
depression among Mexican Americans than among non-Hispanic whites in 
the same survey (see Table 3). Very importantly, among Mexican Americans, 
Burnam et al (1987) reported rates of lifetime major depression for immi- 
grants that are significantly lower than those for the US-born. 

Puerto Ricans in New York reported high symptom levels on the CES-D, as 
well as high rates of DIS-major depressive disorder on the HHANES (Narrow 
et al 1991). These mainland findings are in stark contradiction to the lower 
rates of major depression reported from the island of Puerto Rico, which 
generally conform to patterns reported for non-Hispanic whites, blacks and 
Mexican Americans. Given that the studies in both sites included large 
samples with careful attention to design effects, the New York-Puerto Rico 
comparison is likely to underscore real differences in culturally similar pop- 
ulations; it provides a provocative target for future research aimed at es- 
tablishing links between social ecology and mental health. 

Cubans in Miami were found to have significantly lower rates of depressive 


MINORITY MENTAL HEALTH 369 


symptoms and major depression when compared to other Hispanic ethnic 
groups in the HHANES (Moscicki et al 1987, Narrow et al 1990). Although 
this substudy of the HHANES suffers from small sample size and very high 
refusal rates, the results suggest that Cubans are at low risk for depressive 
symptoms and major depression. However, it should be emphasized that in 
the case of both Puerto Ricans in New York and Cubans in Miami, no 
comparative data are available from other continental US sites. 

Among Hispanics, the rates of major depression are similar for Mexican 
Americans in the United States and Puerto Ricans on the island. On the other 
hand, the highest rates of major depressive disorder are found among Puerto 
Ricans residing in New York, and the lowest rates are reported by Cubans in 
Miami and by Mexican immigrants in Los Angeles. Since approximately 8096 
of the Cubans in the HHANES were immigrants, these two studies reveal an 
important counter-intuitive finding: immigrants from these two ethnic groups 
appear to experience lower vulnerability for major depression than do non- 
Hispanic whites (the overwhelming majority of whom are nonimmigrants). It 
is also worth noting that both symptom levels and diagnostic rates among 
Hispanic ethnic groups assessed in the HHANES mirror precisely the rank 
order of group differences in socioeconomic status presented above in Table 
1. 

Although this information is not presented on Table 2, the relationship of 
sociodemographic variables to symptoms in these Hispanic studies is general- 
ly in accord with the broader epidemiologic literature in reporting universally 
higher rates for women than for men (see Mirowsky & Ross 1989). These 
studies also consistently report an inverse correlation between income/ 
education and symptom levels. Similarly, those in disrupted marital statuses 
and the unemployed have the highest symptom levels. Age effects on symp- 
toms are curvilinear, being highest in late adolescence and early adulthood 
and rising again in late middle age and beyond. 


Asian Mental Health Status 


Our knowledge of the mental health of Asian-Americans is of much more 
recent vintage—and is much less developed—than that reviewed above for 
blacks and Hispanics; in part this reflects the smaller size and more recent 
immigration of many Asian ethnic groups. To date no national-level surveys 
have been carried out. Indeed, as Table 1 shows, the majority of Asian- 
Americans in the United States today are newcomers who immigrated after 
racist laws barring Asians were rescinded in the late 1960s. Historically, the 
oldest Asian-origin group had been the Chinese, who began immigrating to 
California in the 1850s and were excluded by federal law in 1882. Their place 
was taken by the Japanese, who were themselves restricted by the “Gentle- 
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men's Agreement" of 1907, and then by Filipinos, who until the 1930s were 
recruited to work in plantations in Hawaii and California after the American 
colonization of the Philippines. By the early twentieth century, amid agitation 
for immigration restrictions, anecdotal accounts by scientific experts pointed 
to the inferior psychological makeup of Asian immigrants and to the “remark- 
able tendency to suicide" among the Japanese in California (Sanua 1970). 

Later rates-in-treatment studies of first admissions of Asian-Americans to 
state mental hospitals carried an opposite implication: Chinese, Japanese, and 
Filipino patients were grossly underrepresented relative to the size of their 
respective populations. For example, in California state hospitals during the 
early 1960s, the Chinese and Japanese had rates of admissions per 100,000 
population that were one half to one third, respectively, of the rate for 
Caucasians, and lower still than the rate for blacks; and in the late 1960s, 
similar data were reported for Chinese, Japanese, and Filipino patients admit- 
ted to Hawaii state mental hospitals (Sue & Morishima 1982). Several studies 
during the 1970s sought to explain this pattern of “underutilization” of mental 
health services by Asian-Americans, pointing to various cultural and structur- 
al factors (Sue & Morishima 1982). Still, by the beginning of the 1980s, in 
The Mental Health of Asian Americans, Sue & Morishima (1982) could find 
not a single generalizable epidemiologic survey in the research literature that 
reported prevalence rates of psychological distress based on standardized 
instruments with community samples. The available data consisted instead of 
a handful of reports of personality tests administered to small samples of 
Asian-origin college students in California and Hawaii. During the 1980s this 
general picture changed significantly, however, and the results are summa- 
rized in Table 2. 

The first community study of the prevalence of depression among Asian- 
Americans was reported by Kuo (1984). The CES-D was administered to a 
Seattle sample comprising four ethnic groups: Chinese, Japanese, Filipinos, 
and Koreans. (The high refusal rate for the Filipinos in this study was 
particularly unfortunate because, although the Filipinos now form the largest 
Asian-origin ethnic group in the United States, Kuo's research remains the 
only such report to date on their mental health.) A factor analysis of CES-D 
items showed a factor structure among the East Asians (Chinese, Japanese, 
Koreans) similar to that found for Anglo samples, but not for the Filipinos; 
that is, the Filipinos' pattern of expression of depression differed from that of 
the East Asians. For the sample as a whole, the mean CES-D score was 9.4 
(slightly above the 8.4 rate reported for non-Hispanic whites in the United 
States), and the caseness rate was 19.1% (again slightly above the norm for 
non-Hispanic whites). However, there were significant differences in mean 
scores among the ethnic groups: The rate for Koreans (14.4), nearly all of 
whom were recently arrived immigrants, was twice that of the Chinese (6.9) 
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and Japanese (7.3), with the Filipinos falling in between (9.7). Bivariate 
analyses showed no common patterns among the groups in the relationship of 
symptoms to demographic, nativity, or SES variables. Indeed, the differences 
in depression scores among the four ethnic groups did not wash out after 
controlling for all such variables. In addition to ethnicity, only the effects on 
depression of employment and age (with younger respondents having signifi- 
cantly higher depression scores) tended to hold in a multivariate analysis. 

A later CES-D telephone survey was carried out by Ying (1988) with a 
sample of Chinese-Americans in San Francisco. The mean CES-D score for 
this sample was 11.6 (significantly higher than the score for the Chinese 
group in the Seattle study); the caseness rate was 24.296. Lower depression 
Scores were observed for men, for the US-born and immigrants with longer 
residence in the United States, and for higher-SES respondents. Marital status 
was not significantly associated with CES-D scores, perhaps because un- 
married respondents reported many close and supportive contacts with their 
family of origin. When education and occupation variables were controlled in 
a multiple classification analysis, gender differences in depression were 
reduced, and the effects of nativity and length of time in the United States 
were eliminated, although (younger) age emerged as a significant predictor. 

A much more comprehensive study of Korean immigrants in Chicago has 
recently been reported by Hurh & Kim (1988), who used several mental 
health measures (CES-D, HOS, and a life satisfaction scale). Mean CES-D 
scores were 12.3 for males and 12.9 for females. Sociodemographic corre- 
lates of depressive symptoms were generally in accord with the patterns 
reported in the larger epidemiologic literature. Work-related variables best 
accounted for the mental health of male respondents, but their effect was 
much weaker for females. Interestingly, with respect to modes of accultura- 
tion, bicultural strategies (reading both Korean and American newspapers 
regularly, associating with both Korean and American friends) were associ- 
ated with lower depression and higher life satisfaction than was the case for 
any type of monocultural strategy (Americanization, Koreanness). Such addi- 
tive adaptation was related more strongly to the mental health of female than 
of male respondents. Similar results showing a positive effect of bicultural 
strategies (rather than assimilationist or traditionalist attitudes) on both de- 
pression and life satisfaction have been reported among Southeast Asian 
refugees (Rumbaut 1991). 

In Santa Clara County ("Silicon Valley") in 1983, Meinhardt et al (1985— 
1986) compared the mental health needs of the general population against 
those of Southeast Asian refugees (Cambodians, Vietnamese, ethnic Chinese) 
and nonrefugee Chinese (both US-born and recent immigrants from China, 
Taiwan, and Hong Kong). The study employed four FHS scales (measuring 
depression, anxiety, cognitive impairment, psychosocial dysfunction) which 
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had been developed and validated by Warheit and his colleagues in Florida in 
the 1970s (see Table 2). All scores on these scales that were at least one 
standard deviation above the mean for the general population sample were 
defined as high scores; in turn, persons with high scores on all four scales 
were classified as being in “high need” for mental health services, and high 
scores on two or three scales were defined as indicative of “moderate need.” 
By this method, 14.8% of Santa Clara County’s general population sample 
were found to have either high or moderate mental health needs. Better- 
educated, "first-wave" Vietnamese refugees who arrived in the United States 
in 1975 matched this level of need (14.6%); and the non-refugee Chinese 
(over half of whom were immigrants employed in professional or managerial 
occupations) had notably lower mental health needs (9.1%). In sharp contrast, 
51.9% of the Cambodians exhibited high or moderate mental health needs, 
reflecting not only their low-SES background but also the fact that they 
experienced far more traumatic prearrival experiences than did other refugees. 
The post-1975 Vietnamese and ethnic Chinese “boat people” fell between 
these extremes, with mental health needs levels of 26.6% and 19.2%, respec- 
tively. 

These findings on the mental health of Indochinese groups were confirmed 
by a longitudinal study carried out in San Diego County at about the same 
time (Rumbaut 1985, 1989). The refugees’ mental health was assessed by 
several measures, including the General Well-Being Index (GWB) which had _ 
been used with a national sample in the first Health and Nutrition Examination 
Survey (HANES) survey of the American population. The HANES data 
showed a rate of demoralization among American adults of 25.9% (Dupuy 
1974, Fazio 1977, Link & Dohrenwend 1980). The corresponding rate for 
Indochinese refugees in San Diego was three times higher (77.9%) in 1983; a 
year later the rate had declined noticeably but was still very high (65.7%). As 
Table 2 shows, considerable variation appeared among the various In- 
dochinese ethnic groups: the Cambodians had the highest levels of de- 
moralization, followed by the Hmong, the Chinese-Vietnamese, and the 
Vietnamese. Depressive symptoms were socially patterned, confirming re- 
search elsewhere: higher levels were found among women and low-SES 
groups, lower levels among married respondents and those with more rela- 
tives and co-ethnic friends nearby, and the association of symptoms with age 
was curvilinear. On the other hand, this study also found that the refugees’ 
degree of reported satisfaction with different areas of life was not necessarily 
inversely related to their degree of depression; e.g. the Cambodians exhibited 
at once the highest levels of depression and the highest levels of life satisfac- 
tion. Measures of life satisfaction tap into cognitive appraisals or “definitions 
of the situation” which are made by individuals relative to their expectations 
and aspirations; as such, they differ in kind from the affective dimensions of 
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mental health (such as depressed mood, anxiety) that are more often measured 
by symptom scales. 


American Indian Mental Health Status 


Little survey information is available about the mental health status of Amer- 
ican Indians and Alaska Natives in tribal or community populations (Manson 
et al 1987). Recent surveys have used adolescent and adult student samples. 
No doubt the convenience of captive populations within institutional settings 
is the reason for this situation; however, it certainly imposes serious limita- 
tions on the generalizability of current prevalence estimates to adult American 
Indian populations. A summary of these studies can be found in Table 2. 

Only two of these surveys used the CES-D, and both reported very high 
caseness rates using the standard criterion: 58% (Manson et al 1991) and 48% 
(NCAIANMHR 19892), respectively. These are far higher CES-D rates than 
those reported for any other US ethnic minority group listed in Table 2. An 
earlier study by Kleinfeld & Bloom (1977) of Eskimo children in Alaska 
boarding schools, using the HOS, reported a caseness rate of 49975— more 
than twice the rate found among Mexican-American farmworkers in Califor- 
nia and nine times higher than the rate for non-Hispanic whites (see Vega et al 
1985). The highest estimates of 58% (CES-D) and 49% (Health Opinion 
Survey) are for school samples, representing the only studies of adolescents 
listed in Table 2; as such, they should be placed in the context of the broader 
epidemiologic literature, which has consistently found higher symptom rates 
among adolescents and young adults than among middle aged adults (Mirow- 
sky & Ross 1989). 

There are also studies of three student samples reporting that close to one 
half of respondents were suffering from a "phobic reaction" (NCAIANMHR 
1989b). These students included both boarding school students and college 
students in state-supplement universities. Vital statistics data about age- 
specific suicide rates suggest that American Indian children between 10 and 
19 years of age are about three times more likely to commit suicide than are 
other US children. Moreover, data from the Indian Adolescent Health Survey 
indicate that 1896 of males and 2396 of females have had a family member 
attempt or commit suicide. These are clear signs of extraordinary personal 
disorganization and distress, and available information suggests the situation 
is widespread, including tribal groups throughout the United States. 


STRESS AND RELATED ANALYTIC ISSUES 


Perhaps the most intriguing question surrounding the study of minority mental 
health is whether there are specific characteristics inherent in minority status 
that increase risk for psychiatric symptoms and disorders. This type of 
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research is generally guided by social psychological stress theory, which 
presupposes that certain types of stressors. fall disproportionately on certain 
sectors of the population, especially those 2xperiencing more life changes but 
having fewer resources available to cope with them (Pearlin et al 1981). 
Factors such as low socioeconomic statuz, marital disruption, negative life 
events, chronic strain, and low social support are commonly measured in 
these studies as predictors of mental heelth problems, such as depressive 
symptoms. For years it has been conjectured that minority status was itself a 
stressor, independent of the usual socioeccnomic and demographic predictors 
of mental health problems. 

In general, there are really two categories of stressors to which ethnic 
minorities are subjected that may operate as sources of vulnerability. The first 
consists of subjective factors, such as perceptions of unfair treatment or 
blocked opportunity. The second consists of objective statuses and negative 
life events such as unemployment, disrupted marital relationships, frag- 
mented social networks, and physical hardships. In actuality, the two are 
associated because perceptions can result from real life experiences and, in 
turn, perceptions can help shape future life experience. For example, perceiv- 
ing a prejudiced environment is more lixely to influence an individual to 
reside and form social ties within a dense enclave of ethnically similar 
individuals, which can then have repercussions for social mobility. 

From a sociological perspective, the discontinuity of role relationships and 
role strains seems especially pertinent for ‘he study of minority mental health 
(Goode 1960). Structural limitations that ozcur as the result of discrimination, 
limit access to valued social roles, or constrain successful fulfillment of 
ordinary role requirements such as that of parent or spouse are more likely to 
shape the lot of racial minorities. Sociologists have hypothesized that the 
unavailability of means for acquiring va‘ued goals will result in patterned 
behavioral responses among goal strivere which can have destructive con- 
sequences for individuals (Parker & Kleirman 1966). Minority group mem- 
bers who persist in reacting to a racis system may be psychologically 
vulnerable if they are unsuccessful in their efforts (Neighbors 1987). 

Special categories of minority groups scch as refugees are especially likely 
to experience role transitions that dramatically alter the meaning of life. Role 
conflicts can occur in situations of immigrant acculturation where there are 
multiple. definitions of appropriate role 5ehavior, leading to inter- or in- 
tragenerational competition for allegiance -rom family and peers (Rumbaut & 
Rumbaut 1976). Role inconsistencies occur to immigrants who must start 
over again in an occupation of lower prestige than the one thev occupied in 
their nation of origin. Research which has sought to conceptualize and 
measure indicators of these processes, or their effects, is relatively rare and of 
recent origin (see Hurh & Kim 1988, Rumbaut 1991). 
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Among foreign-born minorities in the United States, the distinction typical- 
ly made between "immigrants" and "refugees," while problematic, has fo- 
cused attention on different contexts of expulsion and thus introduced an 
important new variable in our understanding of minority mental health (Portes 
& Rumbaut 1990). Long-distance journeys entail a set of stressful life events 
that, though varying widely in kind and degree, can produce profound 
psychological distress, even among the best prepared and most motivated and 
even under the most receptive circumstances. Both refugees and immigrants 
must cope with a significant amount of life change, but "refugees" tend to 
experience more threat, more undesirable change, and less control over the 
events that define their contexts of exit. The research literature on refugees 
recently arrived from war-torn regions has underscored this point, particularly 
among Cambodian survivors of the "killing fields" of the late 1970s 
(Meinhardt et al 1985-1986, Rumbaut 1985). A comparative study of In- 
dochinese refugees in San Diego found that contexts of exit largely accounted 
for the considerable variance in depressive symptoms among the various 
ethnic groups (highest for Cambodians and Hmong, lowest for ethnic Chinese 
and Vietnamese); however, a follow-up survey a year later found that the 
effects of such past experiences on the refugees' mental health tend to 
diminish, while those associated with their present circumstances (especially 
socioeconomic factors) become increasingly important (Rumbaut 1989, 
1991). 

Some key studies, such as the ones by Warheit and his colleagues, have 
found no differences in symptom mean scores between non-Hispanic whites 
and blacks after controlling for SES. Nor have blacks reported consistently 
higher rates of either symptoms or disorders than non-Hispanic whites. 
Nevertheless, Kessler (1979) notes that "socially disadvantaged persons will 
be both more highly exposed to stressful experiences and also more highly 
influenced by stressful experiences than socially advantaged persons" (p. 
259). In this same research study, Kessler found that nonwhites were twice as 
likely as non-Hispanic whites (28% vs 14%) to report “extreme distress" as a 
function of experiencing more stressful situations. Kessler & Neighbors 
(1986), in comparing blacks and non-Hispanic whites, only found a differ- 
ence in psychological distress level among low-SES blacks. Therefore, race 
and income interacted as explanatory variables in predicting psychological 
distress, but only for the lowest socioeconomic strata. A recent study by 
Ulbrich, Warheit, and Zimmerman (1989) partially confirms the Kessler and 
Neighbors findings. Low-SES blacks were more vulnerable than low-SES 
non-Hispanic whites to the impact of undesirable noneconomic events, but 
less vulnerable than non-Hispanic whites to the impact of economic problems. 

Some limited evidence supports the view that minority status can be 
tentatively considered as a proxy for life stress. Nevertheless, a more com- 
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prehensive understanding of the stress aad coping process among ethnic 
minorities will require. nonrecursive stress process models and longitudinal 
designs to isolate the factors that contribut2 to mental health problems in the 
context of person-environment interactioms. The fact that the variable of 
"race" explains a few percentage points of variance in a linear analysis does 
not really enlighten us about underlying processes that make race salient as a 
predictor of psychological distress. Nor can we assume that SES is easily 
disentangled from race, since both are ind cators for other structural factors, 
including racial discrimination. 


Ethnic Minorities, Acculturation, Anomie, and Mental Health 


Durkheim alluded to the existential component of the human spirit when he 
noted that social areas undergoing rapid social change were more likely to 
produce a disorganizing effect on the personal functioning of individuals 
(Durkheim 1951). In his classic study, suicide was used as a valid indicator of 
environmental impact on individuals. The original construct of anomie con- 
veyed the idea of increasing availability of behavioral choices, and a weaken- 
ing of social controls, which potentially overwhelmed individuals who had 
been reared in a milieu characterized by well-defined, stable social ex- 
pectations. US ethnic minorities experienze several variations on a similar 
theme. Some ethnic minority members are: born and reared in social environ- 
ments that are culturally diverse. Others, such as immigrants and refugees, 
proceed through a transitional process moe akin to that described by Durk- 
heim. That is, they proceed from a stable calture of origin which has provided 
a cultural context for their behavior and self-concept, into ancther cultural 
complex that forces them to reorganize their social expectations. 

These are two very different situations, and the psychological con- 
sequences probably are not similar for bot. In the instance of the immigrant, 
important variables intervene such as the match between expectations for 
social incorporation and the actual opportunities available, and the level of 
resistance offered by the host society. Scme evidence suggests that immi- 
grants who are disappointed with their opportunities in American society are 
more likely to feel depressed (Vega et al 1987). There is also research 
showing that minority immigrants who reside in supportive enclaves have 
fewer mental health problems than those who are isolated in nonminority 
residential areas (Kuo 1976). Moreover, immigrants who have lower ex- 
pectations than native-born minorities may not be subject to the same frustra- 
tion when they encounter social resistance or discrimination. Indeed, it may 
be the process of socialization into enhanced expectations, or the perception 
of a “rigged” game, that is personally debi itating. To the extent that a person 
perceives limited opportunity, as well as powerlessness to change this situa- 
tion, it may be expected that psychological distress will result. This would 
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seem to be true across SES lines but is more likely to be evidenced where 
expectations are heightened (i.e. in relative deprivation). 

These issues suggest that concepts such as anomie, powerlessness, and 
alienation are interrelated constructs. Within American sociology, anomie is 
usually linked to environmentally based change, or the placement of in- 
dividuals in environments characterized by contrasting behavior expectations. 
Moreover, alienation and powerlessness are possible personal concomitants 
of anomie. Recent research (Szapocznik et al 1978) has described how the 
impact of multigenerational acculturation among Cuban immigrants creates a 
similar destabilization of normative expectations. Since new behaviors are 
adopted at an uneven rate by family members, family tension is occasionally 
provoked by lessening commitment to traditional family expectations by 
children, especially by young male adolescents. Parents may feel threatened 
that their beliefs are being depreciated and, worse still, even replaced by 
cultural practices of suspect origin or value. Often parents react by rigidly 
imposing their authority, and adolescent boys respond by completely rejecting 
family expectations. This type of cultural gap or dissonance is associated with 
psychopathology among parents as well as drug use and deviance among 
adolescent male children who struggle to gain an acceptable or coherent 
understanding of their social reality. 

In a longitudinal study of Southeast Asian refugees, Rumbaut (1985, 1989) 
has noted that the psychological impact of immigration tends to be con- 
centrated in the first three years after arrival in the United States. There is an 
initial euphoria that characterizes the first year, followed by a strong dis- 
enchantment and demoralization reaction during the second year, with a 
gradual return to early levels of well-being and satisfaction after the third . 
year. This U-shaped curve (from elation to depression to recovery), which 
marks the course of psychological adjustment over time from first arrival to 
three years thereafter, has been observed among other displaced minorities, 
such as Cubans and Eastern Europeans (Portes & Rumbaut 1990). Recovery 
thus appears to occur in a relatively short period of time, with the adjustment 
occurring more rapidly among those of higher social class backgrounds. 
Similar findings were reported by Ying (1988) for Chinese immigrants: those 
with less than one year in the United States had lower CES-D scores, those 
with one to four years of residence in the country had the highest depression 
Scores, and scores declined significantly thereafter. In a cross-sectional study 
of Korean immigrants in Chicago (which did not include persons with less 
than a year of residence in the United States), the highest levels of depressive 
symptoms (measured by both the CES-D and HOS) were found among 
Koreans who had been in the country for one to two years—a period termed 
the "early exigency stage" (Hurh & Kim 1988). After three years, the Korean 
immigrants’ mental health significantly improved and stabilized over time; 
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the rate of increase in well-being tended to stagnate for those who had been in 
the United States ten years or longer, corresponding to perceptions of having 
reached occupational career limits. 

The Los Angeles ECA survey which assessed psychiatric disorders in 
immigrant and native-born Mexican Americans, and non-Hispanic whites, 
discovered that immigrants had the lowest prevalence of psychiatric disorders 
(Burnam et al 1987). Furthermore, US-bern Mexican Americans and non- 
Hispanic whites were much more likely to >e drug abusers. The investigators 
in this research argue that immigrants in tris sample may represent a “resili- 
ent" cohort that is self-selected because of zhe inherent difficulties associated 
with migratory passage and readjustment n another society. However, this 
explanation is unsatisfying on many counts, and much conceptual and empir- 
ical work remains to be done in order to assess more finely the underlying 
factors responsible for these differences, if indeed the differences are valid 
and replicable. 

Perhaps minority immigrants simply uncergo a very different sociological 
experience with correspondingly different psychological ramifications. 
However, it can be expected, and the empirical evidence seems to suggest, 
that children of immigrants or even long-stzy first-generation immigrants who 
are exposed to highly stressful environments, have a tendency to emulate 
patterns of personal vulnerability and family disintegration that characterize 
other US minority groups. This tendency is exacerbated in urban environ- 
ments featuring highly anomic properties, :ncluding cultural diversity, social 
disorganization, rapid social change, and vast disparities in wealth and access 
to valued resources. The New York experience of Puerto Ricans is a case in 
point, where the level of marital disruption almost doubled between 1960 and 
1980, and levels of depressive symptoms aze extremely high (Bean & Tienda 
1987). 


CONCLUDING RECOMMENDATIONS 


In candor, it should be emphasized that space limitations prevented us from 
broaching some of the most important areas related to minority mental health, 
such as service utilization issues and psychiatric and behavioral disorders 
among minority children. Even our coveraze of adult mental health status is 
selective and cursory. Nevertheless, some recommendations emerge from this 
review that could improve the yield of future research. The primary 
recommendation, encompassing the other suggestions listed below, is that 
future studies use comparative designs and common instrumentation in order 
to extend the power of each investigation. Nevertheless, even this type of 
approach requires preliminary validity stadies to assess both the cultural 
grounding of instruments and their psychometric or measurement equivalence 
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within each ethnic group. Moreover, at a minimum, survey instruments that 
seek to operationalize and measure “mental health" need to distinguish clearly 
between somatic symptoms and “affective” (depression, anxiety) and “cogni- 
tive” dimensions (satisfaction with different areas of life, aspirations). Very 
briefly, other recommendations or questions for future research include the 
following: f 

€ Theoretically informed sociological research needs to move beyond the 
usual correlational analyses of reported symptoms and sociodemographic 
variables, to take social and historical contexts fully into account (Ruiz 1990). 
Among immigrant and refugee groups in particular, analyses need to attend to 
the wide diversity in both contexts of exit and contexts of reception and 
incorporation. Theories coined at the turn of the century to describe the 
assimilation process of European immigrants are poorly suited to grapple with 
the current diversity. In addition, studies of the mental health of nonimmi- 
grant racial-ethnic groups need to distinguish between different US genera- 
tions and to identify the effects of new interracial-interethnic formations; 
categories of "race-ethnicity" as units of analysis need to be conceptualized as 
social processes rather than as fixed, immutable, purely ascriptive categories. 

€ Longitudinal studies are needed to characterize and investigate the stress 
process and its temporal patterning among ethnic minority groups, including 
patterns of migrant or immigrant adaptation to specific conditions of life 
change and their psychological or emotional sequelae over time (Pearlin et al 
1981). 

€ Research is needed to identify protective factors that appear to decrease 
the prospect of suffering from serious mental health problems, or from related 
manifestations such as alcohol or drug abuse, within diverse ethnic minority 
groups. In particular, recent findings that certain immigrant groups exhibit 
lower levels of psychiatric symptoms than do majority group natives thus 
present researchers with a problem in search of an explanation. 

€ Studies of acculturation need to carefully distinguish cultural orientation 
from the assessment of personal disorganization and psychological distress 
that accompanies the acculturation process. Beyond acquiring a new language 
and new cultural information and behavior, what is it about the acculturation 
process that is distressing? Work needs to be done to clarify theoretical issues, 
specify constructs and models, and operationalize valid measures. 

@ Research is needed to investigate the mental health consequences of 
racism and racial discrimination. This type of research has two prongs: social 
costs and personal costs. The former focuses on how life chances affect 
psychiatric morbidity at a societal level, and the latter focuses on the physical 
and psychological ramifications at an individual level. 

€ Future research should explore the meaning of psychiatric signs, symp- 
toms, and dysfunctions within diverse ethnic minority communities. For 
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example, what is the social role of the mentally disordered person in a specific 
minority culture, and how does this contrast with majority grcup patterns? 
Who takes care of the mentally disordered person, and how do people respond 
to him/her? What are cultural beliefs about mental illness etiology and the 
framework for interpreting normalcy ari deviance among ethnic group 
members? Is mental illness considered an immutable condition? What is the 
language used to describe emic disorders, and how are levels of severity or 
dysfunction distinguished? 

Although there is still a significant gap :n knowledge for addressing many 
fundamental issues in ethnic minority mertal health, future research will be 
advantaged by the availability of sophisticated research designs and statistical 
techniques that were unavailable two decades ago. Therefore, despite a slow 
start, the development of knowledge in the next decade will likely far 
outdistance the total of what has been learred to date about the mental health 


of ethnic minorities. 
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Abstract 


Our purpose is to review the enormous range of ethnomethodological research 
from the past three decades. Periodically, scholars have produced review 
articles, monographs, and position papers that usually promote or critique the 
work of a particular ethnomethodological subfield. Also, textbook and other 
accounts of ethnomethodology sometimes impose a homogeneity on the field 
that neglects the various theoretical and methodological strands. We attempt 
to articulate the diversity each of the subfields represents, to clarify dis- 
tinctions between them, and to demonstrate assumptions they share. The areas 
we discuss include theory, phenomenology, cognition, conversation analysis, 
research in institutional settings, studies of science, and applied research. 
While debates about proper topics and methods of research will no doubt 
continue, underneath such debates are a shared orientation to an extant, 
achieved orderliness in everyday activities and a commitment to discovering 
organizational features of direct interaction. 


INTRODUCTION 


From the time that Harold Garfinkel's pioneering Studies in Ethnomethodolo- 
gy was published in 1967, the enterprise it started has been steeped in both 
external and internal controversy. Despite external criticisms and partly 
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through internal contentiousness, however, ethnomethodological scholars 
have produced a substantial body of work over the past 25 years. In fact, 
ethnomethodology has spawned a variety of distinctive subfields, and it is 
more accurate to say that there are several bodies of work, rather than a single 
enterprise. Years ago, for instance, Zimmerman (1978:6) noted "the increas- 
ing diversity among etbnomethodologists, with respect to choice of both 
problem and method.” However, commentators often still treat ethnomethod- 
ology as a unitary perspective; in review articles and monographs, they may 
promote or critique the work of a particular ethnomethodological school or 
subfield. In an Annual Review article three years ago, Atkinson (1988:459) 
aptly remarked that ethnomethodology “is not a homogenous field"; he 
himself, however, mainly described and critiqued just two subfields—the 
recent studies of work and conversation analysis. 

Previous reviews have thus been limited by design, whereas our purpose is 
to articulate, to the extent possible in a limited space, the range and variety of 
ethnomethodological subfields, to clarify distinctions between them, and to 
identify fundamental assumptions they share. We do this by showing how, in 
addressing a range of theoretical and methodological questions, different 
areas of inquiry take up discrete ethnomethodological themes. These include 
considering whether the proper topic of investigation is perception, cognition, 
talk, or embodied behavior; how to conceptualize the relation between in- 
teraction and social structure; and what role ethnography should play in the 
study of situated activities. 


ETHNOMETHODOLOGY AND THEORY 


Although in the preface to Studies in Ethnomethodology, Garfinkel (1967:ix) 
acknowledges his intellectual debt to Talcott Parsons (and Alfred Schutz, 
Aron Gurwitsch, and Edmund Husserl), the exact placement of the enterprise 
with regard to general social theory has remained elusive over the years. This 
may be why theory textbooks and other secondary sources tend to pigeonhole 
ethnomethodology in three characteristics ways.! First, they see ethnometh- 
odology, concerned with how people construct meaning or "definitions of the 
situation," as a version of symbolic interaction. Second, because definitions 
of the situation emerge from how persons announce and impart sense-making 
perceptions and perspectives to one another, ethnomethodology is said to be 
individualistic. Third, ethnomethodology is understood to have emerged as a 
critique of traditional ways of doing sociology. In Alexander's (1987) terms, 
it was a "reaction" against functionalist sociology that soon assumed a 
"rebellious and even revolutionary thrust." 


!We reviewed several theory textbooks to discern these patterns. These include Alexander 
(1987), Collins & Mankowski (1972), Johnson (1981), Ritzer (1983), Turner (1986), and 
Wallace & Wolf 1980. 
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Two recent theoretical reflections on ethnomethodology (Garfinkel 1988, 
Heritage 1984: Ch. 2) should lay these misconceptions to rest. These contri- 
butions make it abundantly clear that while ethnomethodology stands as a 
contrasting and alternative sociology to that of action theory, its subject 
matter is partly defined in relation to the Parsonian distinction between 
concrete actions and the analytic apparatus through which such actions are 
hypothetically ordered and explained. Rather than examining those in- 
trinsically organized particular actions and courses of action that comprise the 
experience of everyday life, Parsonian theorizing addresses a prespecified 
analytical construct—the unit act (Heritage 1984:20-21, Schegloff 
1980:151). In Parsons’ (1937) view, that is, theory at the most basic level 
employs a descriptive frame of reference to order the raw stream of experi- 
ence. Then the theorist can make empirical generalizations and derive an- 
alytical laws to explain phenomena that description has isolated. Thus, for the 
analyst, raw experience has no organization until it is apprehended in- 
terpretively through a stipulated frame of reference. 

Ethnomethodology’s “incommensurate” (Garfinkel 1988) theoretical pro- 
posal is that there is a self-generating order in concrete activities, an order 
whose scientific appreciation depends upon neither prior description, nor 
empirical generalization, nor formal specification of variable elements and 
their analytic relations. From an ethnomethodological standpoint, “raw” ex- 
perience is anything but chaotic, for the concrete activities of which it is 
composed are coeval with an intelligible organization that actors “already” 
provide and that is therefore available for scientific analysis. Central to the 
achievement of this organization are practical activities through which actors 
produce and recognize the circumstances in which they are embedded. The 
principle aim of ethnomethodology is to investigate the procedural accom- 
plishment of these activities as actual, concerted behaviors. Plainly, this is not 
a critique, reaction, or rebellion against other forms of social theory, but 
rather a positive respecification of how investigators might approach sociolo- 
gy’s most awesome phenomenon—the objective, immortal reality of social 
facts (Garfinkel 1988). Sociologists can rigorously explicate that phenomenon 
as an accomplishment of actors’ concerted work in making social facts 
observable and accountable to one another in their everyday lives. 

This is, in a nutshell, the heart of the ethnomethodological enterprise. If the 
substantive concern of ethnomethodology is the achieved intelligibility and 
organization of everyday activities, it can be appreciated that social order does 
not come about because individual actors bring their own cognitive definitions 
of the situation into some kind of convergence or common agreement. The 
focus in ethnomethodology on what are called, interchangeably, “‘pro- 
cedures,” “methods,” and “practices” runs contrary to a cognitive-interpretive 
solution to the problem of order, wherein actors produce patterned courses of 
action because they share internalized frames of reference and value systems 
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that enable common definitions of situations. Moreover, these procedures do 
not represent the solitary resources that singular souls impose upon one 
another; they are systemic resources that members of society concertedly 
enact. Thus, ethnomethodology avoids inferences about how otherwise sepa- 
rated actors abstractly think and negotiate definitions for joint projects and 
instead investigates how members are from the outset embedded in con- 
tingently accomplished structures of social action consonant with their acting 
and reacting to one another in real time. 

As theory, then, ethnomethodology proposes an extant orderliness in con- 
crete activities that is impervious to formal analytic derivation and that 
therefore requires a different form of analysis. Because of its unique stance, 
this subfield has come to occupy its own place in social theory (Boden 
1990a). Moreover, some ethnomethodological theorists (e.g. Hilbert 
forthcoming, Maynard & Wilson 1980, Rawls 1989a, b, Wilson 1982, 
forthcoming; Wilson & Zimmerman 1980), have demonstrated distinctive 
solutions to old problems, thereby forging linkages between ethnomethodolo- 
gy and the classical writings of Durkheim, Weber, Mannheim, and Marx. 
Others develop linkages with, or may critique contemporary theorists such as 
Bourdieu on “habitus” (Ostrow 1981), Collins on “interaction ritual chains" 
(Hilbert 1990), Giddens on “agency” (Boden, 1990a, Wilson 1990), Goffman 
on the “interaction order" (Rawls 1987), and Habermas on “communicative 
action" (Bogen 1989). 


PHENOMENOLOGY AND ETHNOMETHODOLOGY 


Ethnomethodological proposals about indigenous social order represent a 
common backdrop to conceptual and empirical developments for studying the 
organization of everyday life. However, the investigations that Garfinkel's 
corpus spawned are extremely varied and take independent directions that 
depend on which aspects of the work serve as a springboard for further 
inquiry. Thus, whereas numerous commentators have equated ethno- 
methodology with phenomenology (e.g. Best 1975, Rogers 1983), it is more 
accurate to say that a phenomenological sensibility is expressed in 
ethnomethodology than that the latter is or should be a phenomenological 
sociology (cf. Anderson et al 1985, Heap & Roth 1973:363-65, Garfinkel in 
Hinkle 1977:9—15). Here, for purposes of tracing this sensibility, we can give 
only the briefest exposition of phenomenology. At a very basic level, 
phenomenology helped open inquiry into the world of everyday life as a 
"universe of significance" and a "texture of meaning," to use Schutz's 
(1962:10) words. Beyond this, we suggest that the phenomenological 
sensibility expressed in specific ethnomethodological studies has three 
aspects. 
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Gestalt-Contextures and Embodied Activity 

The term phenomenology became a technical one within the history of 
philosophy with Hegel’s Phenomenology of Mind, but Edmund Husserl’s 
investigations (e.g. 1960, 1970) have infused the expression with the philo- 
sophical meanings it manifests today. Recognizing that science presents itself 
as a self-contained enterprise, Husserl argued that its objects and achieve- 
ments rest upon a vast prescientific foundation of unclarified presuppositions, 
which Husserl termed the Lebenswelt or life-world. He further asserted that 
the job of philosophy is to discern and elucidate these presuppositions in a 
progressive way until the ultimate grounds of knowledge are reached (Kockel- 
mans 1967:25-27, Gurwitsch 1964:158, Gurwitsch 1966). This concern with 
presuppositions has meant according consciousness a privileged status for 
phenomenological investigations, because it is, in Gurwitsch's (1964:159) 
words, "the universal medium of the presentation of objects." However, 
where traditional investigations regard consciousness as the place where 
external stimuli imprint coherent sensations in the mind of actors, phenomen- 
ology performs a "suspension" or "bracketing" of belief in independent 
objects as the source of perceptual experience. It is not that the reality of 
objects and the world is denied; simply, the strategy is to see how members 
experience the world and all its manifestations as real through identifiable acts 
of consciousness. Central to this process is intentionality, which, in 
phenomenology, is neither a motivational nor psychological backdrop to 
consciousness (Heap & Roth 1973:355—56). Instead, it points to acts of 
perception that refer to one another and to some connected whole, where that 
apperceived whole simultaneously makes sense of those singular acts of 
which it is comprised (Merleau Ponty 1962:7—8). One aspect of the phe- 
nomenological sensibility in ethnomethodology, then, is this notion of the 
perceptual field as intentionally assembled in the manner of a gestalt con- 
texture (cf Gurwitsch 1964) or what Zimmerman & Pollner (1970) have called 
an occasioned corpus of features to a social setting. 

The explication of local scenes of social action as assembled contextures is 
present in the work of Husserl’s student, Alfred Schutz, whose writings had a 
large bearing on Garfinkel’s early work especially. Schutz (1962) compared 
gestalt contextures to phenomenological concepts such as “noema,” which is 
a perceived intentional object that carries implications through its “inner” and 
"outer" horizons for how each detail of the object achieves its coherence in 
relationship to that very object as it is formed within the stream of experience. 
The most transparent connection between phenomenological conceptions of 
gestalt contextures or noematic relations and ethnomethodological orienta- 
tions is in Garfinkel’s (1967) well-known discussion of “the documentary 
method of interpretation.” By way of the documentary method, members 
consult presumed, institutionalized features of the society as an “underlying 
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pattern" to make sense of particular behav.oral displays, and also employ 
these displays, in turn, as evidence that the infinitely elaborate normative 
features of the collectivity are in place. The dea that social settings and their 
constituent elements are contextually coconstitutive by virtue of mem- 
bers' methodic sense-making activities is a clear debt to phenomenological 
themes. f 

While the phenomenological orientation to contextual embeddedness is 
important, Garfinkel nevertheless deemphasized Schutzian regard for per- 
ceptual knowledge as a mental process or ectivity, because of an emerging 
concern for “embodied” activity and the practical production of social facts 
that emerges in the very details of talk and action as endlessly contingent 
manifestations of real-worldly conduct. As Anderson et al (1985) have 
argued, Garfinkel’s interest in Schutz was methodological in the sense of 
using the Schutzian corpus as teachings about how to reduce theoretical 
categories and objects to fundamental constitutive activities. This movement 
away from the phenomenological accent o1 consciousness has been noted 
before (Heap & Roth 1973:364, Heritage 1984:71, 102, 110, Livingston 
1987, Psathas 1980, Wieder 1977), but it is not like behaviorism, which 
denies the existence of consciousness; it & a rejection of the mind-body 
dichotomy in an effort to make sociolog.cal analysis answerable to the 
corporeity of so-called subjective behavior. Ju Merleau-Ponty’s (1962) terms, 
the body lives things, objects, and features o? the world in general before they 
can be conceived. 


The Relationship of Rules to Human Conduct 


Other than the ethnomethodological studies of work (discussed below), the 
research of Wieder (1974) and Pollner (1987) has extended the phenomeno- 
logical sensibility in ethnomethodology most explicitly. Often quoted in the 
secondary literature is Wieder's (1974) study of a half-way house (Handel 
1982:92—95, Heritage 1984:198—209, Leiter 1980:194—200, Mehan & Wood 
1975:137—42). A key point of the study is a ong-standing ethnomethodologi- 
cal concern: the relation of rules to conduc’. Early on, ethnomethodologists 
(e.g. Zimmerman 1970) argued that the conventional sociological preoccupa- 
tion with rules (norms, roles, etc) as exolanations for patterned human 
conduct is misguided. Instead, rules can be t-eated as topics and as features of 
the very settings they are taken to organize. 

With reference to the “convict code," a set of maxims specifving proper 
conduct for residents of the half-way house, Wieder demonstrates this princi- 
ple elegantly. In a traditional approach, the code could be used to explain why 
residents repeatedly engaged in behaviors taat undermined the rehabilitative 
goals of the house. However, Wieder transforms this explanatory approach 
wherein rules are exogenous and context-free determinants of behavior by 
reembedding the code as a lived feature of ihe settings in which it is “told.” 
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That is, the emphasis is on how people use codes, rules, and roles (Halkowski 
1990b, Hilbert 1981, Maynard 1985:21—23, Watson, 1978) for various ends. 
Thus, a maxim such as "don't snitch," when a resident actually refers to it, 
may be a way of showing solidarity with other residents, or it may exhibit 
defiance against the staff. Staff, for their part, would use the code to exculpate 
themselves for their own lack of knowledge regarding resident conduct, for 
justifying changes in the program, and for defending themselves against 
complaints from the residents. In Wieder's (1974:175) terms, the code is a 
"method of moral persuasion and justification," operating via the documen- 
tary method of interpretation, to provide members with a sense of pattern and 
of a constrained order in their everyday lives (see also Zimmerman & Pollner 
1970), quite independently of any underlying moral orientation to it. The 
phenomenological sensibility thus has a second aspect: A sociological pre- 
occupation with "rule-governed" behavior can be supplanted with an un- 
derstanding of rule-usage as an aspect of members’ repertoire of practical 
actions. 


The Accomplished Objectivity of the World 


In Pollner's (1987) work, the phenomenological sensibility is expressed as an 
appreciation of the grounds upon which both everyday and sociological 
discourse depend. By adhering to a set of "idealizations," actors maintain an 
assumption that "reality," which awaits discovery and grasp, is something 
objective and exterior to themselves. While continuously enacting this 
assumption, and as a condition of making the objective world possible, actors 
perpetually hide their own idealizing work. This is not to say that people 
create reality; it is to suggest that actors are participants in the very forms and 
practices that render the world as having its definitive features. 

The idealizations that Pollner (1987:ch. 2) discusses include anticipations 
of (a) object determinacy—i.e. that the details and pieces of some confronted 
scene are but partial developments of matters that can be fully explicated, (b) 
object coherency—that these details and pieces will come together in an 
intelligible whole, (c) noncontradictoriness—i.e. that these details and pieces 
are congruent and harmonious with one another and are consistent with the 
overall whole of which they are a part. A "crisis" (Pollner 1987:53) of 
mundane reason occurs when contradictory accounts of some real-world event 
emerge. Nevertheless, mundane reason always provides a solution, such as 
finding fault with the subjectivity or perception of one or more participants in 
the crisis, so that the sense of an external and objectively knowable world is 
preserved. 

There are Jessons here not only for ethnomethodological investigations of 
everyday settings, such as traffic court that Pollner (1987) investigates. In the 
fact, members of scientific settings rely on similar idealizations and practices 
for constituting objects of inquiry. Thus, in a series of demonstrations, 
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McHoul (1982:Chapter 4) confirms how misguided it would be to search for a 
presuppositionless inquiry in sociology or any other science. Rather, the 
strategy for ethnomethodological inquiry is to analyze mundane reason in 
such a way as to understand its organizational contribution to inquiries of 
every kind, even those involving itself. Th s requires, according to Poliner 
(1991), a radical reflexivity, a willingness to confront primordial pre- 
suppositions, including those which permit cne's own sustained investigation 
in the first place. 

Heretofore, that sort of radical inquiry is what guided the phenomenologi- 
cal sensibility and aided the discoveries we have itemized above. That is, at 
least three aspects of this sensibility remain as an intrinsic part of ethnometh- 
odological work. These include, firstly, the idea that the experienced social 
world is composed not of discrete "variables" of one sort or another but of 
gestalt contextures that are assembled in and through actors' intrinsic ordering 
activities. This intrinsic ordering activity includes the lived way in which 
percipient bodies initially bring the world into being and only secondarily 
conceptualize it. Secondly, the ordering of the world does not occur through 
following rules or roles or other abstractly formulated proscriptions. Such 
proscriptions are themselves usable resources for “doing” nameable activities 
and providing for a visible, sensible social environment. Finally, the experi- 
ence of an objective world, whether in everyday or scientific settings, de- 
pends upon practical adherence to a set of idealizations or presuppositions that 
require a radical investigative stance for proper inquiry. 


ETHNOMETHODOLOGY AND COGNITION 


Because of early ethnomethodological interest in phenomenology and its 
emphasis on consciousness, the social organ zation of cognition would seem a 
natural topic of inquiry. Indeed, from at lezst two sources, Garfinkel (1967) 
developed a concern for “commonsense knowledge of social structure." He 
followed Kaufmann (1944) in distinguishimg normative rules of social life 
(mores, folkways) from those fundamental or “basic” rules that enable actors 
to identify concrete behavioral displays as specific actions. For instance, by 
way of the basic rules and their “constitutive accent,” participants can see a 
mark on a piece of paper, orient to a regulat on such as "place your X or your 
O within the square," and derive an awareness of playing tic-tac-toe. As 
Garfinkel (1963:195) states it, "the basic rules provide a behavior's sense as 


?The constitutive accent refers to expectancies (a; that the basic rules are to be followed 
regardless of a party's purposes, desires, and circumstances, (5) that the rules are binding upon 
the other party as well as oneself, and (c) that the other party holds the same expectations as does 
ego. 
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an action." When some behavioral display (putting an X on the line) breaches 
the basic rules, it is not like the violation of a norm, wherein a player can be 
seen as deviant within the order of the game. Rather, such a breach violates 
the game as a meaningful order of activities, causes confusion, and invites an 
altered sense of what game is underway. Then, Garfinkel (1963:210-14) 
observed that the basic rules and their constitutive accent are comparable to 
the components of what Schutz (1962) calls the attitude of daily life, a set of 
assumptions or presuppositions whereby participants achieve the "perceived 
normality” of their ordinary social environments (Garfinkel 1963:210—14). 


Cognitive Sociology 


Given the importance of assumptions and presuppositions, and thus the 
references to the subjectivity of actors, it would seem relevant to develop 
further suggestions about the role of cognition in social life. The early work of 
Cicourel (19742), who developed the metaphor of “basic rules" by referring to 
interpretive procedures and comparing them with "surface" or normative 
rules, has this character. Interpretive procedures enable the actor to "assign 
meaning or relevance to an environment of objects" (Cicourel 1974a:30) and 
to decide what norms and roles are operative in a given social situation. 
Employing Schutzian (1964) categories as well, Cicourel (1974a:34—36) 
formulated interpretive procedures in terms of the component assumptions of 
the attitude of daily life, which include idealizations to the effect that a 
speaker and hearer can change places and nonetheless experience social 
objects in a common way, that the speaker's and hearer's biographies will not 
interfere with the assignment of meaning to everyday activities, and so on. 
Furthermore, Cicourel drew an analogy between the assumptions of the 
attitude of daily life (as interpretive procedures) and the "deep structure" of 
transformational grammar (Chomsky 1968). 

Cicourel’s synthesis of Schutzian and Chomskyian concepts and metaphors 
resulted in an emphasis on interpretive procedures that are essentially minded 
or mental phenomena. The cognitive accent also derives from Cicourel's 
(1981:90) reading of the philosophy of language, particularly speech act 
theory as articulated by Austin (1962), Grice (1975), and Searle (1969). 
Thus, Cicourel (1981) makes reference to "information processing," "mental 
processing," and "screening" activities whereby actors "match" or "link" 
prosodic and paralinguistic speech clues with background expectations and 
social structural knowledge to interpret what a speaker intends with a giveri 
utterance. Furthermore, drawing on Rumelhart (1977), Cicourel (1981:98— 
102) argues that sociologists, to fully handle the problem of meaning in 
everyday life, need "schema theory" and "cognitive linguistic concepts" that 
deal with information storage, organization, and retrieval. The main analytic 
topics in Cicourel's oeuvre include typifications, such as those employed in 
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the processing of juvenile delinquents (Cicourel 1968); commonsense knowl- 
edge, as utilized by children, for instance, to arrive at both "right" and 
"wrong" answers on intelligence tests (Cicourel et al 1974); processes of 
memory, such as “selective attention,” “filtering,” and “abductive reasoning" 
in the conduct of medical interviews (Cicourel 1974b), and related matters. 


Epistemic Sociology 


Early on, the work of Cicourel seemed almost definitive of ethnomethodolog- 
ical approaches to cognition. That is, commentators often cited his work as 
exemplary. While the tie to early ethnomethodological preoccupations with 
basic rules, commonsense knowledge, and the like is clear, more recent 
ethnomethodology departs from the mentalistic emphasis. The divergence 
derives partly from the influence of another stream of thought in the philoso- 
phy of language, particularly Ludwig Wittgenstein’s examination of language 
practices—how actors employ words and sentences in concrete situations. In 
Philosophical Investigations and other posthimous publications, Wittgenstein 
(1958) argues that language—rather than being a vehicle for naming things, 
or conveying information, or even enacting intentions according to rules of 
(what is now called) speech act usage—is ar. “activity” or “form of life” in its 
own right. As an example, to analyze a single word in the language—such as 
“description”—and propose that there is a single definable class of phe- 
nomena to which it refers is to neglect that descriptions can be a wide variety 
of things depending on the various roles the word plays in a multiplicity of 
“language games" (Wittgenstein 1958:para. 24). To discover the meaning of a 
word, it is not possible to rely on ostens ve definitions; it is a matter of 
examining contexts of use to discover similarities that Wittgenstein 
(1958:para 67) called “family resemblances.” It is in the actual practice of 
employing words that such resemblances can be traced and the lexical and 
other components of language appreciated as a form of life. 

One form of life in which people participate is making attributions about 
their own and others’ cognitive states. Suck attributions have been subject to 
Coulter’s (1989:16) “sociology of cognitior” or “epistemic” sociology, a set 
of inquiries concerned with the internal log or forms of orderliness implicit 
in actors’ own assessments of subjectivity. Drawing directly from Wittgen- 
stein's (1958, para. 373) arguments, Cou ter (1989:50—51) refers to these 
forms as grammars of cognition, by which he means the linguistic 
frameworks, circumstances, and systems within which cognitive concepts and 
expressions are appropriately applied or used in everyday life. 

Examining grammars of cognition implies avoiding what Ryle (1949:16) 
has called the “dogma of the ghost in the machine,” which perpetuates the 
dualisms of personality theory to the effect that body and mind are separate 
entities, one being public, the other private, one external, the other internal, 


THE DIVERSITY OF ETHNOMETHODOLOGY 395 


one physical, the other mental. Like Wittgenstein, Ryle took on an array of 
cognitive and psychological concepts, debunking the assumption that they 
have essentially subjective referents. These concepts include willfulness, 
emotions of various kinds (feelings, agitations, moods, wants, etc), self, 
self-knowledge, sensation, imagination, and intellect. Consistent with both 
Wittgenstein & Ryle, then, Coulter (1989:58) argues that all of these ghosts 
"are, in fact, mundanely and routinely avowed, ascribed and observably 
presupposed in practical social life." 

What this means, empirically, is explored most fully in Coulter's (19792) 
The Social Construction of Mind. As one example, consider the phenomenon 
of presupposition, a seeming consummate aspect of individual subjectivity. 
Coulter (1979a:52) suggests that actual, spoken. utterances are sometimes 
treated in situ in a way that shows what they might conventionally pre- 
suppose. Here is an example from a brief argument between two children 
(Maynard 1985:15): 


1. Tom: Darn it 

2. (0.2 seconds silence) 

3. Mike: .hhh hhh! You said darn it Tom, Tom you said a naughty word 
4. Tom: That's not a naughty word ^ 


At line 3, Mike notices Tom's line 1 expletive by producing an inbreath/ 
outbreath gasp and then reporting on what Tom has said. The basis of this 
noticeability and reportability is supplied in Mike's next utterance, where he 
characterizes the words as "naughty." Now Jefferson (1974:192) has noted 
that parties distribute items of talk, such as expletives, in terms of appropriate 
environments. Thus, Tom's contradiction of the "naughty word" characteriza- 
tion (line 4) exhibits explicitly what he may have presumed in saying the 
expletive, that it was not improper or offensive in this situation. Mike's line 3 
opposition disallows that presumption and in that way makes it observable. 
As Coulter (1979b:174) remarks, “presuppositions are usable as a hearer's 
argumentative device," and such usages are both socially organized and 
public in character. 

Another apparently quintessential psychic entity Coulter (1979a:59) points 
to is memory. An excellent example of how a grammar of cognition can 
elucidate the social organization of recollection and forgetting is Goodwin's 
(1987) article on "Forgetfulness as an interactive resource," which shows 
how, in a multiparty. setting, a husband's asking his wife about a third party's 
name works to modify the interactants' participation framework. Briefly, the 
"display of uncertainty" or forgetfulness occurs at a moment when the wife 
has begun an utterance that is competing with her husband's, and the display 
solicits her attention in such a way as to dislodge her from her own line of 
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talk. Thus, his “forgetfulness” is analyzable as a public, behavioral, socially 
organized matter with discrete interactional aims and consequences. Sim- 
ilarly, Drew (1989) shows how a display of forgetfulness can be the product 
of a socially organized "testing" scenario, or one in which a speaker manages 
to "expose" something about a recipient and hence the recipient’s com- 
petence, relationship with the speaker, and other matters. 

Ethnomethodology, then, has promoted interest in the sociology of cogni- 
tion, but in at least two very different ways. Drawing from Schutz, Chomsky, 
and ordinary language philosophers who lean toward a concern with internal 
mental structures that underlie and make discourse possible, Cicourel's ver- 
sion is traditional in the sense of remaining with the ghosts in the machine— 
the contents of attention, memory, knowledge, typifications—and how they 
influence interaction. Coulter, in comparison, is indebted to Ryle, Wittgen- 
stein, and others who rehabilitate the ghosts in the machine as part of the 
vocabulary, practices and routines—i.e. social organization—through which 
actors avow, display, and ascribe various subjective qualities to one another 
in everyday life. 


CONVERSATION ANALYSIS 


Conversation analysis, a field originating in Harvey Sacks' collaborative 
inquiries with Emanuel Schegloff and Gail Jefferson, has emerged as perhaps 
the most visible and influential form of ethnomethodological research. A 
number of edited books (Atkinson & Heritage 1984, Button & Lee 1987, 
Psathas 1979, 1990, Schenkein 1978, Sudnow 1972) and special issues of 
journals (Beach 1989, Button et al 1986, Maynard 1987, 1988, West & 
Zimmerman 1980) have featured conversation analytic works. Moreover, as 
Heritage (1984:233) notes, conversation analysis independently influenced 
disciplines cognate to sociology, including anthropology, communications, 
linguistics, social psychology, and cognitive science. Because a number of 
excellent secondary accounts of conversation analysis exist (Atkinson & 
Heritage 1984, Boden 1990b, Goodwin & Heritage 1990, Heritage 1984: 
Chapter 8, Lee 1987, Levinson 1983: Chapter 6, Zimmerman 1988, Whalen, 
forthcoming), we only touch upon topics that are well summarized in this 
literature. At the most general level, conversation analysis is concerned with 
the methodical construction in and through talk of member-productive and 
analyzable social action and activity. Exemplary topics include the organiza- 
tion of sequences (as perhaps the identifying mark of most conversation 
analytic studies), the structure of repair, the "preference" for certain con- 
versational forms (agreement, for example) as an organizational rather than 
psychological phenomenon (Pomerantz 1984), the primacy of ordinary con- 
versation and its relation to social structure and institutional environments 
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(Boden & Zimmerman, forthcoming, Clayman 1989, Drew & Heritage 
1991), the interrelation of vocal and nonvocal activities (M. Goodwin 1980, 
C. Goodwin 1981, Heath 1986) and others. Moreover, research in this area 
has been cumulative, with early studies providing a basis for the exploration 
and analysis of concerted activity in new domains and for the explication of 

"single episodes" of interaction (Clayman & Whalen, 1988-1989, Schegloff 
1987a, 1988-1989, Whalen et al 1988). 

As this volume of work, its cumulative character, and its influence on other 
areas attest, conversation analysis is an endeavor with a momentum and 
trajectory separate from ethnomethodology (Schegloff 1989). Other in- 
fluences on Sacks include, most prominently, Erving Goffman, with whom 
Sacks studied at Berkeley, and a wide range of nonsociological intellectual 
developments, which include Wittgenstein’s ordinary language philosophy, 
Chomskian generative grammar, Freudian psychology, anthropological field 
work, and so on (Schegloff, 1988, 1989). Still, the relationship between 
Garfinkel and Sacks was clearly crucial to conversation analysis, and bonds 
between the two areas run deep. Secondary accounts often suggest that 
conversation analysis emerged as a “kind” or “form” of ethnomethodology 
but fail to explicate precisely how it emerged and what specific form it entails, 
although ten Have’s (1990) thorough review suggests that conversation analy- 
sis is a strategy or method for dealing with the “invisibility of commonsense.” 
In the interest of explaining the diversity of ethnomethodology, we wish to 
make two other observations about its relationship with conversation analysis. 
One observation deals with indexical expressions and the other concerns a 
contrast between conversational procedure and talk as an embodied activity. 


Indexical Expressions and Sequential Organization 


A natural place to start is with a collaborative article by Garfinkel & Sacks 
(1970), entitled “On formal structures of practical actions," which is con- 
cerned with the phenomenon of natural language. All natural language is 
indexical, which means that the understandability of any utterance, rather 
than being fixed by some abstract definition, depends upon the circumstances 
in which it appears. Even the verbal products of "practical sociological 
reasoning," wherein laypersons or professionals theorize about society, are 
indexical in character, despite the best efforts to render these propositional 
phrases as objective, context-free, and universal as possible. Hypotheses, 
ideal types, interview schedules, coding formats, and so on, all show the 
effort to employ abstract expressions that will prevail over the contingencies 
inherent in the situations where such expressions are meant to apply. Yet we 
know that they never do cover all the exigencies of interaction in particular 
settings; our best laid descriptions, categories, and explanations always leave 
something out, need fudging, or are replete with inconsistencies. Moreover, 
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because any linguistic usage is invariably irclexical, the effort to remedy the 
circumstantiality of one statement by producing a more exact rendition will 
preserve that very feature in the attempt. Thus, there is no time out from the 
phenomenon. Instead of approaching indexical expressions as a methodolog- 
ical nuisance (in the way that conventional social theory and research often 
does), therefore, an alternative is to exam ne their constitutive properties. 
This is the tack of Garfinkel & Sacks (1970:341), who argue that the 
properties of indexical expressions are ordered and socially organized; such 
orderliness, moreover, “is an ongoing, prectical accomplishment of every 
actual occasion of commonplace speech ard conduct.” Indeed, rather than 
being a problem in need of remedy, indexical expressions can be a resource 
for broadly social ends. 

This, then, raises a question as to whet social activities comprise the 
orderliness of indexical expressions. As one instance, Garfinkel & Sacks 
(1970) discuss “formulations” through which members describe, explain, 
characterize, summarize, or otherwise "say- n-so-many-words” what they are 
doing or talking about. The use of formulations is a socially organized one in 
that, for instance, formulations may arise at a point where the "determinate" 
gist of a potentially multifaceted conversation has become problematic, and 
they regularly invite confirmation or denial (Heritage & Watson 1979). As 
another instance of the orderliness to indexical expressions, Garfinkel & 
Sacks (1970) discuss “glossing practices" and a collection of examples. One 
of these is “a definition used in first approximation." At the beginning of an 
article, an author may offer a loose definiion of some term, subsequently 
developing arguments and exhibits to support the definition. At the end, the 
author will supply a second and more prec.se definition of the term, which 
formulates the features and connections among the exhibits, arguments, and 
definitions (Garfinkel & Sacks 1970:364). 

Neither formulations nor glosses, which are themselves indexical, provide 
the essential means for rendering conversational activities intelligible, howev- 
er. Sacks (1967, lecture 11: 10) takes up thiz very problem in his unpublished 
lectures: 


If . . . somebody produced an utterance and you fizured that a way you would show that 
you understood it was to produce an explication cf it, then that explication would pose 
exactly the task that the initial utterance posed. Ard you could see how rapidly it would 
become an impossible procedure. And in any evenr would involve some sorts of constant 
and possible indefinitely extended "times outs" in a conversation. 


While it is impossible to accomplish mutual understanding by explicating 
utterances i.e. through formulations or glosses, Sacks argues that the mech- 


?The previously-unpublished lectures of Sacks are making their way into print. Early lectures 
(1964—1965) are in Sacks (1989), and the full corpu. is being published as Sacks (1990). 


THE DIVERSITY OF ETHNOMETHODOLOGY 399 


anism of tying one utterance to another through “pro-terms” is an economical 
way of achieving intelligibility (cf Watson 1987). Pronouns, which, in any 
given utterance, may refer to some other noun or category on whose behalf 
they stand, are characteristic tying devices, as are what Sacks (1967, #11:1) 
calls "pro-verbs": 


So, for example, in a sequence like: “I got injured a bit, too,” “You did?", then “You did" 
is tied to the prior utterance. 


Notice that the relation between the pro-verb (“did”) and its reference (“got 
injured") partially depends upon the contiguity of the two utterances. From 
observations like these, it is not too far distant to consider what is perhaps "the 
most powerful device for relating utterances” (Sacks, Spring 1972, #4:1)— 
the adjacency pair (Schegloff & Sacks 1973). Among other tasks, this device 
allows phrases and clauses (“because I wanted to,” or “to get home”), which 
otherwise are not grammatical sentences, to be comprehendible con- 
versational utterances as second pair parts or answers-to-questions (Sacks, 
Spring 1972, #1:23). 

Stated differently, the work of tying and adjacency-pairing suggests the 
overall importance of positioning utteránces for purposes of intelligibility. 
This operates in two ways. On the one hand, interactants comprehend utter- 
ances by considering them in relation to preceding talk. Correspondingly, 
they display their understanding through tying their own utterances to pre- 
vious ones (Sacks et al 1974;728-29). The devices of tying, placing, and 
positioning pieces of talk generalize to a proposition that the sequential 
features of conversation are pervasively operative in the analysis of any given 
utterance (Sacks, Fall 1967, #13:10). Therefore, if talk is indexical, as in the 
use of pro-terms, phrases, and other clauses whose meaning is contextually 
derived, it is a phenomenon accomplished as participants build utterances in a 
sequential fashion to make them intelligible and to show understanding in 
precise ways. In such fashion are the indexical properties of talk ordered. 

This accomplishment, moreover, implies a social organization that is local 
and for itself, rather than, say, operating on behalf of some externally based 
social structure, such as class, gender, or ethnicity. In that participants tie one 
utterance to another, a recipient who wishes to speak to whatever topic is on 
the floor is required to listen not just to some utterance-in-progress, but to the 
spate of previous talk, for it is in terms of this previous talk that the current 
utterance itself makes sense. Additionally, when taking a turn of talk, a 
current speaker is required to demonstrate its relationship to an immediately 
previous utterance and, via the practice of tying, to the utterances preceding it 
(Sacks, Fall 1967, #11). 

Indexical expressions are a window, then, through which to gaze upon the 
bedrock of social order. Actors achieve all manner of relationship, inter- 
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dependence, and commitment (Rawls 19892), and perform an infinite variety 
of social actions, through the precise placement of single turns of talk in 
relation to surrounding vocalizations and nomvocal gestures as well (Goodwin 
1986). Thus, one bond between conversatio1 analysis and ethnomethodology 
resides in the extensive exploration, provided by conversation analysis, into 
one profoundly ordered property of indexical expressions, that is, their se- 
quential organization. 


Conversational Procedures and Talk as an Embodied Activity 


Sacks' insight into tying and consequently into the temporal and sequential 
organization of conversation was a brilliamt one for sociology, yielding a 
fundamental understanding of social order. From this insight, we can see the 
development of the interest in turn-taking as a basic device involving sequen- 
tial organization as it is described in one of the most theoretically and 
empirically important pieces of ethnomethodologically related research 
(Sacks et al 1974). However, the analysis cf turn-taking has been subject to 
complaint for its formalism (cf Liberman 1€85, Lynch 1985: Ch. 5, Molotch 
& Boden 1985, Peyrot 1982). For instance, Livingston (1987:73) argues that 
descriptions of disengaged and objective rules for turn-taking miss the embod- 
ied work by which conversationalists exhib# and ensure their talk is account- 
ably being done turn-by-turn. 

While this might be one reading of the paper or of some of its offshoots, the 
analysis of turn-taking does show a deep corcern with turn-taking practices as 
achievements rather than as causal and disembodied rules (cf Schegloff, 
forthcoming). That is, participants enact the turn-taking system in no mechan- 
ical fashion; instead they compete, often sharply, to be able to talk, and 
respond to the moment-by-moment emergence of enormously consequential 
and diverse exigencies, including gaps, owerlaps, hesitations, false-starts, 
errors, topical trajectories, and so on. Fo- example, Goodwin (1981) has 
shown that when the speaker of a turn-at-talk notices a recipient’s gaze 
wander, that speaker will recycle the turn-so-far, effectively requesting recipi- 
ent to return gaze to speaker. What may then appear (on an abstracted 
transcript of the interaction) as some accidertal utterance restart is involved in 
the assembly of a turn-of-talk during whica the recipient will have been in 
ocular attendance to the speaker. The interaction therefore consists not just of 
some sort of "adherence" to turn-taking procedures, but of systemic moves 
wherein it is witnessable that one and just one "speaker" is talking to a 
“recipient” who is listening. Conversational turn-taking is organized not like a 
system of rules or syntax of language, but in and as the particulars and details 
comprise talk in its spoken, real-time actiality. On a moment-to-moment 
basis, turn-taking is a concerted, interactive achievement. 

This point can be extended with the juxtaposition of conversation analytic 
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and ethnomethodological investigations of a most mundane event in daily life, 
a telephone call. Schegloff's (1968) study of telephone openings revealed a 
significant property of ringing phones, namely that they are summonses, and 
as such require an answer. Together, a summons and its answer comprise an 
adjacency pair whose use helps coordinate entry into conversation. Schegloff 
(1968) arrived at this characterization through examining a corpus of 500 
telephone calls and, at first, formulating a “distribution” rule to the effect that 
"the answerer speaks first." However, one case, in which the caller spoke 
first, did not fit the pattern. This prompted a more finely-grained analysis, in 
which Schegloff (1968) proposed that a telephone ring and its answer form a 
summons-answer sequence. Furthermore, because summons-answer se- 
quences have the property of conditional relevance, when someone produces 
the first item of the sequence (a summons), the second part (an answer) is 
thereupon expectable and, if absent, it is noticeably absent and subject to 
repair. With regard to a ringing telephone, this means that if someone picks 
up the telephone but says nothing, a caller will regularly reissue a summons 
and thereby "speak first" by saying "hello." 

The conditional relevance of an "answer" to a summons is clear to anyone 
who has heard a ringing phone and has felt compelled to withdraw from some 
current engagement or activity to answer it. The sequential organization of 
first pair parts and second pair parts is very powerful, and Schegloff's (1968) 
elegant analysis demonstrates an achieved, unitary solution to the problem of 
coordinated entry that operates across a variety of settings, across vocal and 
nonvocal activities, and even across the duration of a single conversation. 
Nevertheless, there is a further orderliness to the process of summoning and 
answering than this analysis provides. That is, the study of sequences raises 
the question of how sequential components (e.g. summonses and answers) are 
recognizably constituted. For example, Schegloff (1986:118-21, cf 1968) 
himself notes that persons in the environment of a ringing phone engage in 
various analyses to determine just who should answer, and how and when. 
That is, the summons is itself a socially assembled object in terms of how it 
implicates a recipient's responsive action. Ethnomethodological demonstra- 
tion of this point is in Garfinkel’s “summoning phones” exercise. Students are 
asked to gather tape recordings of ringing telephones that are (a) hearably 
summoning just them, (b) hearably summoning someone else, (c) simulating 
hearably summoning just them, (d) simulating hearably summoning someone 
else, and (e) just ringing. A full explication of this exercise is now published; 
it demonstrates how “methodic procedures" render lived experience and its 
intrinsic orderliness into “signed” objects whose interpretation necessarily 
loses a grasp of such orderliness (Garfinkel & Wieder 1991). In this regard, 
we wish to raise only two prominent points. 

First, in making a “ringing phone” the circumstance for further action, 
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parties are engaged in primeval social, compositional activities. Thus, that a 
phone is ringing depends upon an actor foregrounding some high-pitched 
frequency from a heretofore differently constructed background that im- 
mediately has the character of silence out of which the just-now hearable 
phone-ring emerged. Moreover, that a phone is hearably summoning one or 
another or nobody in particular is the outcome of participants' activities which 
on singularly distinguishable occasions provide for just how the phone is 
ringing. This includes that it is “just another ringing phone” of which many 
have been heard and that it is nonetheless soliciting some particular some- 
one's answer, in part because of the "direction" from which the ring may be 
heard (Garfinkel & Wieder 1991). The organization of summons-answer 
sequences can be decomposed to appreciate that any comparability of a 
"ringing phone" across settings, rather than residing in the nature of phone 
rings as uniform and context-free stimuli, is the outcome of situated work as 
an embodied presence in settings of social action wherein exists an in vivo 
coherence to the phenomenal details of which the object ("ringing phone") 
originally consists.“ 

Second, as an example of other aspects of order that can be unraveled from 
an accountably summoning phone, consider that “who” a ringing phone is 
summoning depends upon how an actor, in concert with others, further forges 
the social environment in which that event occurs.? This process can include: 
(a) how one categorically orients to the environment—as one's own office or 
home, as someone else's office or home, as a public domain, and so on; (b) 
the informings that are available prior to or during the phone-ring, such as 
“Jane should be calling soon," or "That's Jane"; (c) whether one is using a 
phone and calling someone else, such that the ringing represents an "outgo- 
ing" summons and is on the other end of the line, or in a place where an inert 
phone can suddenly -commence with a bell or other type of noise that 
represents an "incoming" summons; (d) the expectations that result from 
relationships through which parties construct an understanding that the other 


^Along these lines, Schegloff (1968:1090) has said, "The activity of summoning is . . . not 
intrinsic to any of the items that compose it; it is an assembled product whose efficacious 
properties are cooperatively yielded by the interactive work of both summoner and answerer." 
See also Schegloff's (1986:118) discussion of how an apparently mechanical and standardized 
telephone ring is, in fact, a "socially and interactionally shaped product." Finally, a more 
exhaustive treatment of these matters can be found in Schegloff's (n.d.) revised dissertation 
chapters. 

?The following list of examples derives, in part, from taped and written comments of 
participants in an ethnomethodology seminar when Harold Garfinkel was a visitor in the 
Department of Sociology, University of Wisconsin, Spring 1990. Also compare this list to 
Schegloff's (1986:118—20) remarks on how the phenomenal properties of a ringing phone are 
interactionally produced. 
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will call one just here, just now—for example, "my wife's parents call every 
Thursday night about this time"; (e) the spatial positionings and activities of 
members of an office or household vis-á-vis one another and the telephone— 
for instance, the nearest to a ringing phone may be the "summoned" party, or 
the one who is not presently “working” or otherwise engaged; and (f) how 
each of the above—(a) through (e)—are themselves in situ accomplishments 
in detail. » 

In general, Garfinkel (1967) long ago offered the possibility that every 
feature of a setting is an endogenous, concerted achievement. This means that 
there is embodied work to the production of even those features that some- 
times are discussed as "rules," "resources," "first pair parts," "second pair 
parts," and the like. Turn-taking as a kind of sequential organization might be 
a "resource" for the production and understanding of indexical expressions, 
for instance, but it is important to see such organization as itself a part of the 
activity for which it is both a resource in the production and understanding of 
indexical expressions, and itself an orderly achievement. Summonses migbt 
be first-pair parts that make answers conditionally relevant and thus serve to 
initiate a conversational sequence through which participants can coordinate 
and make accountable their entry into conversation; those summonses are also 
phenomena of orderly achievement. The lesson here is that any principle, 
including sequential organization, by which it is possible analytically to order 
the phenomenal detail of every life, including talk, is a detail of order that 
recursively permits appreciation as itself an organized accomplishment. 

We have specified two possible intersections between ethnomethodology 
and conversation analysis. One intersection derives from how Sacks' con- 
cern with the tying and positioning of utterances (and ultimately with sequen- 
tial organization) relates to Garfinkel’s (1967) and Garfinkel & Sacks’ (1970) 
proposal that indexical expressions have ordered properties. Conversation 
analysis, then, is like an extended treatise on a major, socially accomplished 
and organized resource for the production and understanding of indexical 
expressions. A second intersection is recognized and adumbrated in conversa- 
tion analytic studies and relates to an ethnomethodological concern for 
"embodied production." The issue concerns how conversational objects, 
procedures, and practices, while involved in the assembly of sequences, 
speech exchange systems, and other structures of talk, can be further de- 
composed as orderly phenomena. 
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SFor a discussion of another relationship between ethnomethodology and conversation analy- 
sis, see Heritage (1984: chapters 4 and 5) and Goodwin & Heritage (1990). In terms of moral 
sanctionability, there is a connection between (a) adhering to the Schutzian (1962) "attitude of 
daily life" and general sense-making procedures, and (b) orienting to adjacency pair and other 
sequential frameworks. 
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ETHNOMETHODOLOGICAL STUDIES OF 
INSTITUTIONAL SETTINGS 


Many of Garfinkel’s initial investigations were carried out in casual arenas of 
social life rather than in the institutional settings around which so many 
sociological subdisciplines have developed. The well-known “breaching” 
experiments, in which students were asked to act like boarders in their own 
homes, or to question the taken-for-granted meanings of ordinary remarks of 
family, friends, and acquaintances, exemplify this analytic tendency. Even 
when examining clinics, juries, and the like, Garfinkel's (1967) early work 
was not so much about medicine, law, or other specialized occupational 
practices as it was concerned to portray an order of phenomena not previously 
studied within sociology: namely, the methods used by societal members to 
render their circumstances orderly and intelligible. As one example, Garfink- 
el’s (1967:Ch. 3) experiment with student counseling was designed to in- 
vestigate the properties of the documentary method of interpretation rather 
than the work of counseling. The phenomena of primary interest were those 
practices essential to the accomplishment of coherent social interchange 
wherever it occurs. 

We refer to this interest as institutionally indifferent (cf Garfinkel & Sacks 
1970), in the sense of disregarding prior theoretical and other specific charac- 
terizations upon which such studies might aim to improve. Cicourel's early 
work was similarly focused on structures of interaction per se rather than the 
specific arenas in which it occurred. Thus, empirical studies of juvenile 
justice (Cicourel 1968) may have shown how aspects of that particular system 
worked, but they also revealed the role of actors’ theories and language in 
making categories (such as "delinquent") available as a sense-making device. 
Theoretical statements about children's acquisition of social structure 
(Cicourel 1970) were not about socialization traditionally conceived, but 
about the background expectancies by which children as competent members 
of society participate in the work of sustaining normal forms of social 
interaction. Much conversation analysis has had a similar stress. As Sacks et 
al (1974) have noted, there was no special interest in particular settings. 
Rather, in developing a “naturalistic, observational discipline," they dis- 
covered conversation to have properties that were analyzably involved in 
accomplishing mundane activities of various sorts (Schegloff & Sacks 1973). 

Nevertheless, ethnomethodological inquires have also illuminated role- 
based, task-oriented, or otherwise institutional activities. Researchers have 
ventured into a wide range of bureaucratic and occupational settings—e.g. 
classrooms, courtrooms, medical clinics, police departments, public welfare 
agencies, and elsewhere—to produce findings that are institutionally sensi- 
tive. These studies explicate the processes through which participants perform 
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official tasks and in an ongoing way constitute formal systems, institutions, 
and organizations. Such institutionally sensitive research has employed both 
ethnographic and discourse analytic methods; we shall examine exemplary 
studies from each tradition in turn. 


Early Ethnographic Studies 


Using ethnographic techniques of research, many of Garfinkel’s early stu- 
dents and colleagues examined methods of practical reasoning employed by 
bureaucratic workers in the course of their official duties. A theme runs 
through studies of various institutional settings, including criminal and ju- 
venile justice (Bittner 1967a,b, Cicourel 1968), plea bargaining (Sudnow 
1965), hospital wards (Sudnow 1967), correctional facilities (Wieder 1974), 
and public welfare agencies (Zimmerman 1969, 1970). Following what we 
have called the phenomenological sensibility, they demonstrate the in- 
adequacy of formal rules and official procedures for capturing the detailed 
work that is necessary to perform competently the tasks that each setting 
poses. The most elaborately codified rules and procedures cannot ex- 
haustively enumerate the range of situations to which they should apply; they 
inevitably require judgmental work which cannot itself be specified by rule, 
for it is the foundational process through which rules of all sorts are im- 
plemented. For example, police officers must subordinate strict enforcement 
of the law to such broader concerns as keeping the peace (Bittner 1967b). 
Staff in a welfare agency follow bureaucratic intake procedures only when 
they facilitate an orderly flow of clients, and they suspend or deviate from 
such procedures if that is necessary to sustain a smoothly running office 
(Zimmerman 1970). Still, while rules and procedures fail as literal de- 
scriptions of institutional practice, they are not irrelevant to understanding 
those practices. Actors use them to make sense of their circumstances and 
invoke them as rhetorical and accounting devices in dealing with others and 
getting their jobs done. By these.means, actors may provide themselves and 
others with a sense, after all is said and done, that organizations and in- 
stitutions are “rational” concerns. 

Ethnomethodological understanding of institutional processing has further 
implications for how institutional products should be understood. First, in- 
stitutions and organizations generate official designations such as “crimes,” 
“juvenile delinquents,” “qualified welfare recipients," “dead patients,” and so 
forth. Studies reveal that such designations do not result from workers 
consulting clearly defined necessary and sufficient criteria for each case. In 
the same way that actors adapt rules to situational exigencies, they apply 
labels in relation to commonsense methods of experiencing and handling 
troubles and problems. Since many of these studies concern the work of law 
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enforcement and criminal justice, they have :mportant theoretical implications 
for the constitution of “deviance” as a social fact. The ethnomethodological 
slant on the constitution of deviance, in shewing detailed ways in which the 
"politics of experience" occurs and how categories can be ex post facto 
justifications for decisions already made, is, however, distinctive (Garfinkel 
1967:14, Mehan et al 1986:156—7, Poliner 1987:Ch. 4). 

In regard to institutional products, a second and related contribution of 
ethnomethodological studies is in the sociclogical understanding of official 
statistics. While sociologists have long recognized the problems with such 
data, ethnomethodological researchers have documented the reasons such 
statistics do not reflect some “real” rates of deviance in the society. Garfinkel 
(1967:14—18, 208-61) and others (Atkinsor 1978, Cicourel & Kitsuse 1963, 
Cicourel 1968, Douglas 1967, Meehan, 1985, 1989, Smith 1974) have shown 
how practitioners employ commonsense kncwledge of the society's workings 
for making decisions about what "counts" as an instance of the thing being 
tallied. Inevitably, then, the quantities of su.cide, or juvenile delinquency, or 
mental retardation in the society reflect tbe interpretive work of professionals 
in the field, who assemble the particulars of any case according to their 
typifications of people, motives, and situations. 


Discourse Studies 


Much work in institutional settings is discharged through the medium of 
discourse or spoken interaction. Trial examinations, plea negotiations, medi- 
cal interviews, citizen calls to the police, business meetings, and news 
interviews are just some arenas in whick talk is a ubiquitous presence. 
Inevitably, talk and interaction shape the very institutions and organizations in 
which they are embedded. Nevertheless, ethnomethodologists have different 
positions regarding how institutional discourse should be studied. One strat- 
egy emphasizes the importance of supplementary ethnographic investigations, 
including observations of organizational epvironments and interviews with 
participants (Cicourel 1981, 1987). An argument can be made that earlier 
encounters affect any given interactional episode; therefore analysis of such 
episodes should incorporate an understandina of the contexts that surround it. 
In Mehan's (1991) terms, interviewing participants to reveal their patterns of 
commonsense reasoning is a device that can show the relation of organization- 
al and other “distal” contexts to “local” or “proximal” patterns of language use 
in institutional settings. 

Conversation analysts take a different approach to institutional talk, con- 
cerned that using terms such as "doctor's office," "courtroom," "police 
department," "school room," and the like, tc characterize settings and warrant 
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contextual investigations is a vernacular practice that can obscure much of 
what occurs within those settings. In doing the work of the institution 
members also use "casual talk," "repair" mechanisms that aid understanding, 
preference forms (e.g. agreement) and other devices that are fundamental to 
all conversation, not just that in work and professional settings. Hence, 
background information obtained ethnographically is unlikely to be of help in 
elucidating concrete language practices that accompany and underlie various 
business or professional tasks. For this reason, conversation analysts rarely 
rely on ethnographic data and instead examine if and how interactants them- 
selves reveal an orientation to institutional or other contexts (Sacks 1972, 
Schegloff 1987b, Wilson 1991). 

Conversation analysts who study occupational settings are thus primarily 
concerned with the sequential organization of the talk in them, and this 
concern has taken a variety of forms. One strategy has been to explicate the 
turn-by-turn assembly of particular activities, including requests for police 
assistance (Zimmerman 1984, Whalen & Zimmerman 1987), accusation 
sequences in trial discourse (Atkinson & Drew 1979), bargaining sequences 
in plea negotiations (Maynard 1984), agenda-setting in business settings 
(Anderson et al 1987, Boden 1991), questions and answers in job (Button 
1987) and news interviews (Clayman 1988, Greatbatch 1986), the delivery of 
diagnostic news (Maynard 19892), and other phenomena. 

Another conversation analytic strategy is to describe the distinctive turn- 
taking systems that organize such institutional settings as trials (Atkinson & 
Drew 1979), classroom lessons (McHoul 1978, Mehan 1979), news in- 
terviews (Greatbatch 1988), mediation hearings (Garcia 1988), and con- 
gressional hearings (Halkowski 19902). These studies are indebted to Sacks et 
al's (1974:729—30) observation that ordinary conversational turn-taking is a 
baseline among speech exchange systems, against which other forms of talk 
can be compared. Closely related to these descriptive studies of turn-taking 
are those that analyze how modifications of the system serve to accomplish 
various bureaucratic and occupational functions, including "shared attentive- 
ness” in courts and other multiparty settings (Atkinson 1979, 1982), “neutral- 
ity" in news interviews (Clayman 1988, Heritage & Greatbatch 1991), and 
designing talk for an "overhearing" mass media audience (Heritage 1985, 
Heritage & Greatbatch 1991). 

Still another strategy of conversation analytic studies has been to examine 
how discrete sequence types from conversation become specialized, sim- 
plified, reduced, or otherwise structurally adapted for institutional purposes 
(Heritage 1984:239—40). Thus, Heritage (1984:280-290, 1985) has compared 
question-answer sequences and the distinctive third-turn responses they evoke 
in various casual and institutional settings, demonstrating that the specific 
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type of setting is recurrently honored and displayed by variations in these 
humble receipt items. Correspondingly, educational testing is in part accom- 
plished through alterations of a basic th-ee-part “pedagogical sequence" 
(Marlaire 1990, Marlaire & Maynard 19°90). In a slightly different vein, 
Whalen & Zimmerman (1987) describe how police call openings are special- 
ized and reduced, vis-a-vis their conversaticnal counterparts, to accomplish a 
focused order of business. Also, Clayman (1989) shows how news interview 
closings are specialized to enable interactants to achieve termination at a 
prespecified point in time. As one last example, Maynard (1991) identifies a 
“perspective display series” that occurs im both ordinary conversation and 
clinical informing interviews, but which appears to have a rigid structure in 
the latter context owing to an institutionally based impetus to deliver a 
diagnosis. Across these studies, comparative analysis with ordinary conversa- 
tion enables investigators to identify what is distinctive about institutional 
discourse and what is generic to interaction as such. 

In general then, ethnomethodological investigators traditionally have con- 
ducted research in institutional settings, bat their analytic strategies differ 
along several lines. Some-studies, revealimg basic phenomena of order, are 
indifferent to prior theoretical depictions of the setting under scrutiny. Other 
studies are institutionally sensitive and aim to contribute not only to 
ethnomethodology but to general sociologi-al understanding of traditionally 
defined arenas of law, medicine, media, and so on. Early ethnomethodologi- 
cal studies, showing a bedrock of organizational activities beneath surface 
processes and products of institutional settings, relied in methodology almost 
exclusively on ethnography. More recent research places spoken interaction 
in the foreground, and differences revolve around the role of ethnography as 
an analytic tool. While Cicourel (1987), Mehan (1989), and others argue that 
comprehending locally produced pieces of talk necessitates attention to the 
wider setting of which it is a part, conversation analysts argue that the wider 
setting is itself best grasped in terms of the momentary production and 
modification of conversational sequences. Rather than seeking background 
knowledge of the context, these investigators compare talk and interaction 
across settings to illuminate what is “interactional” and what is “institutional” 
about a given situation.’ 


"We have only scratched the surface of issues suxounding ethnomethodological studies of 
institutional settings and on the role of ethnography. Far a more thorough consideration of issues 
regarding institutions, see Drew & Heritage, forthcom ng; for another perspective, see Bogen & 
Lynch (1989). For discussions of ethnomethodology and ethnography, see Emerson (1983), 
Emerson & Pollner (1988), Maynard (1989b) and Sharsock & Anderson (1982). For discussion of 
conversation analysis and ethnography, see Goodw-n (1990), Maynard (1984, 1989b) and 
Moerman (1988). à 
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STUDIES OF WORK: THE DISCOVERING SCIENCES 


While other ethnomethodological scholars have been studying cognition, 
describing the perceptual assembly of gestalt contextures, explicating mun- 
dane reasoning, analyzing conversation, or exploring the local organization of 
institutional settings, Garfinkel and his close colleagues (Lynch et al 1983) 
have turned to examine highly technical competencies that comprise aspects 
of mathematics (Livingston 1986) and the natural sciences, including astrono- 
my (Garfinkel et al 1981), biology and neurology (Lynch 1982, 1985, 1988), 
and optics (Bjelic & Lynch 1991). 

If the study of these specialized domains seems to represent a departure, it 
also resonates with several themes and topics of long-standing ethnomethod- 
ological interest, including the concern with taken-for-granted methods of 
reasoning-in-action or situated rationality (Boden 1990a, Garfinkel 1967: 
262-83). In this vein, Garfinkel's initial studies probed the managed 
accountability of inquiry in several work or occupational settings—i.e. coro- 
ners determining cause of death (1967:11—18), jurors deliberating guilt or 
innocence (1967:104—15), and medical personnel keeping clinical records 
(1967:186—207). The current research on scientific work describes with more 
precision those courses of practical reasoning and embodied action that are 
integral to the achievement of some of the great technical discoveries of 
mathematics and science. 

This line of research has a philosophical antecedent in Husserl's phenome- 
nology, but with crucial differences. As we pointed out earlier, Garfinkel 
deemphasized matters of consciousness to bring forward the orderliness of 
concrete activities. Correspondingly, he rejected Husserl's philosophical and 
introspective method of analysis and substituted a commitment to naturalistic 
Observation grounded in a deep familiarity with and, preferably, bona fide 
competence in the discipline under scrutiny. Current studies of work thus 
probe the details of "shop work and shop talk" that form the tangible fabric of 
scientific practice (Lynch et al 1983:233). 

Prior to. these initiatives, the domain of concrete scientific reasoning and 
scientific practice had remained largely unexamined. Because scientific dis- 
ciplines ordinarily require practitioners to carefully document their own 
methods in sufficient detail to permit replication, this may seem surprising. 
However, scientists! descriptions of their practice are notoriously incomplete 
(e.g. Kuhn 1970:136—43, 187-91, Gilbert & Mulkay 1984, Knorr-Cetina 
1981, Knorr-Cetina & Mulkay 1983, Latour & Woolgar 1986). There is 
always "something more" to methodological practice than can be provided in 
even highly detailed instructions and formalized guidelines (Lynch et al 
1983:207), and that "something more" "delimits a field" of previously un- 
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investigated phenomena.? Moreover, while a great many biographical, histor- 
ical, and sociological accounts of scientific development exist, virtually none 
explore the routine "workbench practices" that are the stuff of technical 
competence as a lived experience in cormlucting laboratory experiments, 
proving mathematical theorems, and deploying other technical skills (cf 
Suchman 1987). Accordingly, ethnomethocological studies of science chart 
largely unknown territory. 

A distinguishing feature of scientific Cisciplines is that they generate 
discoveries about the properties of physical or mathematical objects, and 
ethnomethodological studies analyze the foundations of discovering “work.” 
For example, Garfinkel et al (1981), using z tape recording of astronomers at 
work, explicate the courses of inquiry and action through which astronomers 
discovered a pulsar. Instrument readings, wich initially had an indeterminate 
sense, came to indicate this independent p3ysical object through organized 
courses of inquiry. That is, the pulsar's availability to competent astronomical 
observation, its objectivity and public verifiability, rested upon an array of 
situated natural language and bodily practices. Similarly, Lynch (1982, 1985, 
1988) studied biologists preparing, working with, and talking about various 
forms of biological and neurological data. The character of such data, and 
conclusions about their usability or artifactuality, result from constitutive 
forms of shop work and shop talk that accomplish specific tasks. These tasks 
are assembled not according to plans or written accounts, but as ways of 
meeting circumstances, handling trouble, paying attention to other matters, 
using formulae, and so on, that make of competent task performance an 
invariably unique achievement. Further stil, in describing the lived work of 
proving mathematical theorems, Livingston (1986, 1987:86-137) demon- 
strates that schedules of equations and diagrams represent only one part of a 
mathematical proof. Proofs consist of the pairing of such formulae with 
embodied courses of inference and action that are essential to the proof's 
intelligibility, coherence, and technical adequacy. 

Such “pairing” of abstractions and embod:ed activity is difficult to appreci- 
ate without participating in the scientific work itself. A paper by Bjelic & 
Lynch (1991), however, enables readers to experience that sort of technical 
work. Rather than describing a workplace competence—in this case, a 
prismatic demonstration of color phenomena—and perpetuating the limita- 
tions endemic to all abstracted descriptions cf situated action, Bjelic & Lynch 
(1991) provide a minimal set of instructions and allow readers, armed with 
their own prisms, to work through the demonstration for themselves. Readers 
thus acquire a first-hand, indeed handed, acquaintance with the practical 


8See the special issue of Human Studies on "Representation in scientific practice" (Lynch & 
Woolgar 1988). 
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skills (e.g. in manipulating and seeing through a prism, generating color 
displays in a specific sequential order, and so on) that alone achieve an 
objectively workable demonstration. Moreover, by going through two in- 
consistent theories of color (those of Newton and Goethe), readers are able to 
grasp intimately the way in which phenomena consistent with alternative 
spectral realities may be produced through appropriate manual, visual, and 
textual procedures. 

In an intriguing way, this recasts the very nature of scientific discovery. If 
worldly objects of technical inquiry are irremediably intertwined with embod- 
ied practices, then scientific discoveries may have a dual character. In addi- 
tion to being discoveries of physical or mathematical objects, they require 
innovative, contingently accomplished, and particularized courses of practical 
reasoning and embodied action through which scientific objects may be made 
demonstrably available and intersubjectively accountable. A radical character 
of these ethnomethodological investigations is that, if adequately specified 
with detail and precision, they may eventually yield findings that represent 
recognized contributions to the scientific disciplines under examination, even 
as they further sociological understanding of the depth of detail involved in 
social organization and order. 


CONCLUSION 


*Ethnomethodology is here to stay,” as Boden (19902) has put it, and has 
even moved to the suburbs (Pollner 1991), although the practitioners often 
remain more ghettoized than suburbanized. Nevertheless, we are seeing 
increased numbers of ethnomethodologically inspired studies in prominent 
journals, edited collections, and monographs. These studies are influencing 
various streams of sociological theory, social psychology, the sociology of 
Science, law and medicine, the sociology of social problems and deviance, 
and other substantive areas in a wide variety of disciplines. 

Still, old labels die hard. Conjectures about whether ethnomethodological 
studies are idealistic or empiricist, subjective or neopositivist, conservative or 
liberal (or even radical), symbolic interactionist or phenomenological or 
atheoretical, and so on, no doubt will persist.? So will the belief that 
ethnomethodology is individualistic and rebellious. It is not only that such 
conjectures miss the mark, but that they impose a uniformity on the area that 
simply does not exist. We have shown the theoretical, methodological, and 
empirical diversity within ethnomethodology, and highlighted what its prac- 


?For summaries and citations to articles that pose these labels for ethnomethodology, see 
Boden (19902), Heritage (1987), Maynard (1986), and Rawls (1985). 
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titioners see as crucial issues. Our own categories have necessarily left out 
some streams of research, condensed irtricate arguments, and refined 
nuanced portrayals of the orderliness to everyday life. As just one further 
piece of the ethnomethodological fabric, ccnsider how ethnomethodologists 
have applied their expertise to extend previous theory and research that is 
ethnomethodologically based into new domains such as gender relations 
(West & Zimmerman 1987) and feminis inquiry (Smith 1937), politi- 
cal oratory (Atkinson 1984, Heritage & Greatbatch 1986), ceremonial activ- 
ities (Mulkay 1984), social problems (Mzynard 1988; Holstein & Miller 
1989), survey interviews (Schaeffer 1991, Suchman & Jordan 1990), and 
to professional settings, such as education (Heap 1990) and medicine 
(Frankel 1983, Beckman & Frankel 1984). where remedies to the troubles 
people experience (Jefferson 1988, Jefferson & Lee 1981) may be sought or 
offered. 

Thus, ethnomethodology transcends disciplinary boundaries and runs the 
gamut from pure to applied research. Overall, as ethnomethodology con- 
stitutes and reconstitutes itself, and further anforms and influences a range of 
social science inquiries, it seems clear that tle diversity of ethnomethodology 
will continue to proliferate. Surely, the theory of an already extant and yet 
achieved orderliness to everyday activities, the phenomenological sensibility, 
the concern for language use, and other themes, will be shared among those 
under the widening ethnomethodological umbrella. Just as surely, debates 
about the role of cognition in ordinary interaction, the differences between 
ethnomethodological and conversation analytic approaches to talk, the use of 
ethnography, the relation of discourse to its institutional context, whether 
studies in institutional settings are properly indifferent or sensitive to tradi- 
tional ways of understanding such settings, end other issues, will also endure. 
Garfinkel's work (e.g. 1988) may then provide a reminder that there are 
always “more and other" aspects to the phenomena of everyday life, as- 
pects that, in abstract dialectics cannot be argued, imagined, or hypothe- 
sized to exist, or even reduced to "practices" and "methods" of whatever sort. 
Organizational features of everyday life ere only discoverable, and thus 
require enduring contact with society's members, an appreciation of the 
courses of action they inhabit, and studied involvement in their mundane 
tasks. 
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Abstract 


Until recently little theoretical or empirical attention has focused on the ways 
in which socially structured experiences in the workplace affect the in- 
teractions that occur within families. This review considers the many levels at 
which family interaction is currently being studied, presents arguments 
regarding expected relationships between aspects of workplace experiences 
and spousal and parent-child interaction, and summarizes the relatively small 
body of empirical research that links occupation and family interaction. It 
emphasizes the extent to which emotional consequences of work mediate the 
effect of workplace conditions on family interaction. The chapter reviews 
evidence suggesting that a variety of workplace conditions—restriction of 
opportunity to exercise self-direction, work overload, poor quality of in- 
terpersonal relations on the job, low opportunity for cooperative problem- 
solving, job insecurities, job loss, and low earnings—have emotional reper- 
cussions that have negative implications for family interaction. It argues that 
future research can illuminate social influences on family interaction by more 
fully incorporating observational methods to investigate both workplace and 
family processes and by more systematically and explicitly bringing the larger 
Occupational and economic context into the emerging analyses of the micro- 
structure of family interaction. 
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INTRODUCTION 


Adults participate in both occupational life and family interaction, and social 
scientists study both. Yet until recently surprsingly little theoretical or empir- 
ical attention has been paid to the ways in which socially structured experi- 
ences in the workplace affect the interactionz that occur within families. This 
brief review emphasizes research that examines how and under what con- 
ditions adults’ employment experiences affect interactions between spouses 
and between parents and children within farrilies. I review the many levels at 
which family interaction is currently studied. present arguments regarding the 
expected relationships between aspects of workplace experiences and spousal 
and parent-child interaction, and summarize the relatively small body of 
empirical research that links occupation and family interaction. I argue that a 
key pathway by which workplace conditions affect spousal and parent-child 
interaction is through the emotional consequences of work for individual 
workers. Finally, I make some recommendations for future work. 

The scope of this chapter is relatively narrow, but several more general 
treatments of work and family issues have recently appeared, including 
reviews of paid employment and its reletion to unpaid domestic work 
(Thompson & Walker 1989, Ferree 1990), of relations between occupation 
and personality (Spenner 1988), and of relations between work and family 
(Menaghan & Parcel 1990, Piotrkowski et al 1987, Spitze 1988, Voydanoff 
1987, Huber 1988, Eckenrode & Gore 1990z,b). Useful general introductions 
to family interaction processes and their measurement include Aldous (1977) 
on family interaction patterns, Bakeman & Gottman (1986) on observing 
interaction, Fitzpatrick (1988) on spousal intzraction, and Patterson (1990) on 
depression and aggression in family interaction. 


INNOVATIONS IN THE ASSESSMENT OF FAMILY 
INTERACTION 


To learn about family interaction, either we zan ask people to tell us about it, 
or we can observe them interacting. In each zase, the researcher's job is to try 
to make sense of what family members repcrt or what s/he observes in terms 
of a more limited set of dimensions. The dimensions usually examined by 
researchers tend to focus on the positive/megative tone of interaction; the 
relative emphasis on cohesion and togetherness versus separateness and in- 
dependence; and the degree of flexibility in structure or organization that 
characterizes family routines and family interaction. For example, David 
Olson's (Olson et al 1983) influential typology contrasts families on two 
major dimensions: cohesion, which ranges from enmeshed (excessively 
bound together) to disengaged (excessively separate), and structure, which 
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ranges from rigid (extremely firm, unchangeable organizational rules) to 
chaotic (excessively loose patterns which provide little predictability to family 
experience). As is evident from the labels attached to the extremes of these 
dimensions, the theoretical argument is that intermediate positions on both 
dimensions are most effective and personally satisfying. In contrast, research 
on positive/negative emotional tone emphasizes the advantages of positive, 
respectful, and supportive words and actions, and perhaps even more strong- 
ly, (see Gottman & Levenson 1986) the dangers of negative verbal and 
nonverbal behavior, rather than viewing some intermediate emotional tone as 
ideal. 

For marital interaction in particular, researchers have also been concerned 
with the relative power of spouses in decision-making and interaction. Here 
too, an intermediate position in which both husband-dominated and wife- 
dominated patterns are avoided seems optimal. Both clinicians and interacting 
individuals judge less positively the highly competitive patterns marked by 
asymmetrical dominance (see Grey-Little & Burks’ 1983 review). Study of 
parent-child interaction has also been concerned with power, but it has 
focused more on strategies for child socialization, contrasting active efforts to 
alter child behavior with more laissez-faire permissive approaches and dis- 
tinguishing between inductive and coercive strategies to gain child com- 
pliance (see Peterson & Rollins 1987 for reviews of this literature). The 
parent-child literature has also emphasized the extent of parental involvement 
and the emotional tone of interaction, contrasting positive support with 
hostile, critical, rejecting behavior. 

These dimensions have been measured both by self-report and coding of 
observations. A large number of self-report inventories have been developed 
asking about spousal interaction, parent-child interaction, or interaction of the 
whole family. While earlier self-report measures ask respondents to mentally 
average everyday occurrences and produce a description of their usual pat- 
terns, more recent work has collected multiple daily (e.g. Bolger et al 1989, 
1990) or more frequent (e.g. Csikszentmihalyi & Larson 1984) reports from 
the same individuals. Such repeated measures may expose neglected sources 
of variation in "usual" patterns. For example, Crouter & McHale (1990) 
highlight variations by season both in maternal work involvement and in the 
relations between work and family life. As is discussed further below, such 
reports can be averaged to construct a more reliable measure of usual patterns, 
or the focus can be on understanding patterned variations over time in 
individual reports as a function of other factors. Of course, such reports tell us 
about the participants' own construction of the relationship and the interaction 
that occurs. Reports of individual family members need not be congruent, nor 
may they match the ratings of an "outside" observer. 

Direct observation of family interaction has become increasingly frequent 
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and increasingly sophisticated. Observation studies tend to focus on di- 
mensions similar to those used in self-report studies, with particular stress on 
the emotional tone characterizing the interaction and the way that families 
members attempt to resolve conflict. The number of family members studied 
and the settings in which they are observed vary. While some early and 
provocative studies observed families’ everyday routines in their own homes 
(e.g. Kantor & Lehr 1975, Piotrkowski 1979), studies of couples' and 
families’ interactions more often use laboratory settings and some structured 
stimulus for interaction (e.g. Raush et al's 1974 study of newly married 
couples and Reiss's 1981 study of families with adolescent children; but see 
Godwin & Scanzoni's innovative 1989 study of taped interactions at home of 
188 randomly selected married couples, and Conger et al's 1990 observations 
of spousal interaction in 76 rural midwestern couples). Observations can be 
coded to produce overall ratings of the family as a whole or measures 
summarizing the behavior of individual family members. A recent example of 
the former is Wampler's Georgia Family Q-Sort (Wampler et al 1989). In this 
measure, trained raters summarize the most salient aspects of a family's 
Observed interaction by sorting 43 descriptors (selected to represent di- 
mensions of cohesion, structure, and communication) into a 3-4-5-6-7-6-5-4- 
3 distribution, from those that are least descriptive of this family to those that 
are most like this family. Summarizing the behavior of the individuals 
involved in an interaction emphasizes instead such measures as the distribu- 
tion of each individual's responses on a set of categories or the proportion of 
total turns taken by a given family member. 

Such global indicators and their correlates demonstrate the importance of 
overall patterns. As studies of self-report measures have also shown, families 
"doing better" are characterized by more positive affect, less negative affect, 
and less tension and reserve, as well as greater organization and negotiation in 
problem-solving (Wampler et al 1989). Similarly, numerous studies indicate 
that both spouses in unhappy relationships are characterized by higher than 
average rates of negative verbal statements (for review, see Baucom & Adams 
1987). But such findings do not illuminate how individual actions and reac- 
tions, statements and rejoinders, work to sustain or undermine more global 
features of interaction. How is it that a mother and son begin by feeling 
positive or neutral toward one another and end up in rage or tears? How do 
couples facing a major disagreement get from there to a mutually accepted 
resolution that leaves both partners feeling positive about their spouse and 
good about themselves? 

To answer such questions, studies have moved to more fine-grained analy- 
ses of sequences of behavior and the contingencies among specific behaviors. 
Pioneers in such studies include Raush et al (1974) and their program of 
observations of husband-wife improvisations in researcher-constructed con- 
flicts. They found that couples tend to reciprocate the emotional tone and 
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direction of the previous uttérance, with task-oriented, emotionally neutral 
remarks followed in kind, emotionally supportive comments also recipro- 
cated, and negatively charged comments such as attacks, attributions of 
selfish motives, or rejection tending to entrap a couple in a vicious cycle of . 
reciprocated negatives. The reciprocal pull of negative comments was more 
powerful than positive or neutral ones. Among the newlywed pairs studied by 
Raush et al (1974), husbands were somewhat more apt to break free of a 
negative spiral, while wives were somewhat more likely to use negative 
emotional pressure. 

The most visible and comprehensive current programs of such research are 
those of John Gottman and his colleagues on spousal interaction and Gerald 
Patterson and his colleagues on mother-son interaction. Gottman (Gottman & 
Levenson 1986; see also Gottman et al 1977) describes similar reciprocity in 
negative affect and alternating accusations of blame, but in contrast to Raush, 
he finds that wives were more likely to break cycles of negative affect and 
suggests that wives are generally more able to manage negative emotions and 
their physiological manifestations than are husbands (Gottman & Levenson 
1987). How such patterns emerge and how they are maintained are important 
questions. Patterson and his colleagues have applied a social learning perspec- 
tive to understanding parent-child interaction and, in particular, boys’ aggres- 
sive and delinquent behavior. In observations of mother-son interaction, 
Patterson studies the contingencies that are established. Patterson argues that 
mothers inadvertently “train” their sons to persist in aggressive, coercive 
patterns by reinforcing those behaviors (Patterson 1982, 1988, Patterson et al 
1989). 

A recent innovation in the study of sequences of observed behaviors is to 
supplement the ratings of trained observers with simultaneously collected 
physiological measures. John Gottman and his colleagues have pioneered the 
collection of extensive physiological data during marital interaction, includ- 
ing use of chairs affixed to platforms with movement sensors (see Gottman & 
Levenson 1987). Gottman and collaborators have also extended the study of 
children's physiological states using space-ship-like settings that constrain 
children's movement without eliciting resistance (Gottman & Katz 1989). 
These physiological data are entered into computerized tracking systems that 
link them in real time with other coding of ongoing interaction. In this way, 
researchers can examine the emotional and physiological concomitants of 
various sequences of behavior and begin to move beyond analyses of behavior 
per se to better understand its emotional and physiological underpinnings. 

Observed marital interaction occurs in the context of a history of interaction 
that is unknown to the scientific observer. Thus, the observer may mis- 
understand some utterances and may be surprised at the verbal or physiolog- 
ical responses they elicit. Some recent research uses self-report data in a novel 
way to better understand observed interaction: The family participants view 
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videotapes of their own interaction and provide ongoing codes of their 
interpretations of their own and their spouse's statements. Analyses of the 
spouses' own interpretations and attributions helpfully supplement the naive 
observer’s inferences regarding the kind of statement that was made and how 
the spouse reacted to it. 

Other researchers have combined these two innovations, by collecting 
additional physiological data from spouses as they view their own previously 
recorded interaction and evaluate it. Gottman reports that observers physio- 
logically relive the earlier experience, producing data that closely parallel the 
original record of emotional and physiological responses. The advantage of 
the participants’ commentary, of course, is that the emotions, interpretations, 
and attributions of the interacting members can be explicitly and systematical- 
ly brought into the analysis, rather than relying on researcher inferences and 
speculations that “It’s almost as if X interprets A as meaning C"). 

As Bradbury & Fincham (1990) suggest in their review, individual attribu- 
tions are causally linked to satisfaction/distress. For example, making more 
global, stable, negative attributions regarding one's spouse's behavior is 
causally linked to greater dissatisfaction with the marriage. Bradbury & 
Fincham also report that the most dissatisfied couples rate outside circum- 
stances as least important in causing negative events. By contrast, nondis- 
tressed spouses rate outside circumstances as being most important (1990, p. 
18), consistent with their view of negative events as linked less often to 
global, stable deficiencies of their spouses and more often as a reflection of, 
for example, negative experiences in other roles. Bradbury & Fincham articu- 
late the linkages between attributions, the emotional effects of attributions, 
and the roles that both attribution and emotion play in shaping observed 
marital behavior and reported marital satisfaction. 

Thus, direct study of individual attributions promises more complete ex- 
planation of the sequences of observed behavior that shape marital distress. 
These innovations help to bridge the gap between self-report and observation 
traditions in the study of family interaction. Other researchers have further 
expanded the field by combining individual interview data from multiple 
family members with observer-rated interaction to build more comprehensive 
explanations of individual distress and dissatisfaction with relationships (God- 
win & Scanzoni 1989, Conger et al 1990). Such studies highlight the contri- 
butions of each individual's characteristics to his or her ratings of self and 
spouse. As Lorenz et al (1991) remark, findings using such multiple sources 
of information may differ from those obtained from single-informant re- 
search, which is typically plagued by high correlations among conceptually 
distinct concepts (see also Bank et al 1989, regarding such “glopping” 
together). Multimethod, multirespondent investigations, of course, are highly 
time-intensive and expensive, and too often are hampered by small sample 
sizes. Survey researchers have been enormously aided by the development of 
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large omnibus surveys which become public-use data sets available to all 
members of the research community. A parallel effort to collect and archive 
multimethod data sets that include direct observation of family interaction 
: ps is Still in the future. 


EXPLANATIONS FOR FAMILY IER 
PATTERNS 


Some of the most exciting theoretical work on family relationships has 
proposed more ecological (Bronfenbrenner 1979, 1986) and life course (Elder 
et al 1984, see also Bolger et al 1988) perspectives that link individual 
behavior and outcomes in one set of social roles to experiences in other roles 
and to historical and social context. Yet when one considers current methods 
for studying family interaction and the explanations proposed, it appears that 
the expansion of methods to include close attention to sequences of behavior 
has inadvertently reduced the scope of explanatory variables considered. 
Often, the total net of explanatory variables and hypothesized processes is 
limited to those operating within the interaction. For example, studies de- 
scribe how happily and unhappily married couples differ in their sequences of 
behavior or their attributions; these interactional contingencies are treated as 
explanatory variables that account for the reported unhappiness of marital 
partners. The researchers! very success in documenting these important 
effects of interaction patterns makes it all the more important to investigate 
just as carefully and systematically the social factors external to the interac- 
tion in itself that shape better and worse interaction patterns. 

But such linkages are not made. For example, Bradbury & Fincham's 
(1990) integrative model of attributions and marital satisfaction posits that 
individual attributions are causally dependent on the behavior and in- 
terpretations of one's partner and oneself that occur within the interaction. No 
factors external to the marriage are included. Similarly, Fitzpatrick (1988) 
distinguishes several major models to explain marital interaction, including 
social exchange models which emphasize the ratio of rewards to costs within 
interactions; social learning models, which emphasize intended and in- 
advertent social reinforcement of particular behaviors in ongoing interaction; 
structural models which emphasize the symmetry and predictability of se- 
quential patterns in interaction; and arousal/emotion theories, which empha- 
size the unpleasant physiological arousal that occurs when individuals feel 
themselves under threat. None of these models moves outside the interaction 
itself to consider individuals' experiences in other social roles or relationships, or 
the subjects' place in the larger system of social stratification. Indeed, except for 
gender, no indicator of such experience is brought into analyses, and gender itself 
tends to be discussed in terms of putative biological tendencies rather than 
socialized differences in experience and interpretation. 
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Patterson's extensive research on boys' aggression acknowledges that ex- 
ternal stressors increase parental difficulties in being consistent and effective 
with their children (Patterson et al 1989), but even here the focus of study, 
and intervention, is the interaction itself. Thus, the way to "fix" unsatisfying 
family interaction patterns is to decrease unhelpful attributions of individuals, 
to decrease individual tendencies to engage in counter-productive behaviors, 
or to avoid reinforcing aversive child behaviors. 

Of course, it would be an overstatement to claim that explanatory factors 
are completely restricted to events within the interaction; but the researchers 
do seem to have trouble in getting outside of the family. For example, Elder's 
influential studies of children and adolescents in Depression-era families call 
attention to the important role that personal dispositions, measured prior to 
economic change, play in qualifying and tempering the negative effects of 
economic change on parent-child interaction and child behavior problems. A 
more positive and harmonious relation between husband and wife in the time 
prior to the economic loss had a similar tempering role. Economic hardship 
seemed to accentuate existing individual and marital vulnerabilities, while 
individual and interpersonal strengths were surprisingly powerful in warding 
off potential negative effects of economic loss (see Elder & Caspi's 1988. 
summary of this research, and Liem & Liem's 1990 research on current 
unemployment that reports similar patterns). Elder is careful to note, howev- 
er, that even the most highly rated marriages exhibited increased tension 
under the stress of economic deprivation (Liker & Elder 1983). More recent 
studies of families caught in the farm crisis of the 1980s echo these findings 
regarding families caught in the Great Depression: In each case, economic 
hardship tended to increase husband's negative, hostile, and irritable behavior 
and to decrease the warmth and supportiveness of their interaction with their 
wives. These negative changes in behavior tended to erode their wives' 
marital satisfaction and led to increased thoughts of divorce (Conger et al 
1990). 

The powerful role of such personality vulnerabilities prompts attention to 
their antecedents. Adult personal disposition is traced back to one's own 
parents' irritability, and their ineffective or explosive parenting (Caspi & 
Elder 1988, Elder & Caspi 1988). These linkages are carefully examined, and 
they are surprisingly strong, even across four generations (Elder et al 1986), 
but the total net of explanatory constructs considered never moves beyond the 
past and present family. Similarly, studies of child behavior problems con- 
sistently document their link to marital discord and disruption (Emery 1982, 
Peterson & Zill 1986; see Gottman & Katz 1989) without going any further in 
their explanatory search. Researchers are more apt to focus on how marital 
discord influences parental behaviors, but are less likely to pursue the stress- 
ors impinging on marriages that make both discord and disruption and less 
effective parenting more likely. 
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This renewed emphasis on explaining family interaction from within and 
from past family interaction is ironic, given past work. The pathbreaking 
studies of families in the Great Depression and under the stress of global war 
made it clear that societal events, as well as individual experiences outside the 
family, have implications for individual well-being and family interaction that 
cannot be derived from analyses that treated the family as a closed system 
exempt from other influences (see for example Cavan & Ranck 1939, Hill 
1949, Elder 1974). Certainly, as my review indicates below, the extensive 
work on men's unemployment leaves little doubt that it pervasively affects 
men's emotions, sense of self, and interactions with others, including wives 
and children. Yet the importance of economic and occupational experiences 
for family interaction patterns is not integrated into much of the most sophisti- 
cated and methodologically innovative current research in family interaction 
(see Bronfenbrenner's 1986 summary of the sparse literature that has linked 
external environments to family functioning). As a stimulus to such work, this 
chapter outlines several ways in which occupational and economic experi- 
ences may influence individual and family propensities to act. 


Understanding the Linkages between the Workplace and the 
Family z 

It is an easy thing to posit that individual experiences outside the family have 
an influence on subsequent family interaction. It is more difficult to identify 
processes by which such effects occur and methods for examining such 
processes. Nevertheless, implicit in the existing work on family interaction is 
a model of at least one set of processes by which both family and external 
circumstances should play a similar causal role in initiating or maintaining 
more problematic interactions among family members. For example, Raush et 
al (1974) describe a dynamic to explain variations within marital interaction 
that is quite similar to that described in Piotrkowski’s (1979) explanation of 
variations in responsiveness to occupational stressors, in Gottman’s (1979) 
account of responses to interpersonal conflict, and in Downey & Coyne’s 
(1990) description of the impacts of emotional distress and clinical depres- 
sion. This set of processes involves negative emotional arousal, which 
prompts responses that are intended to salvage the individual sense of esteem 
and-competence but that have negative long-term interactional consequences. 
Many other experiences, including racism, sexism, economic insecurity, and 
assault may evoke similar responses. 

In all cases, an event or situation (mediated by individual interpretations) 
increases the individual's feelings of threat and uncertainty. S/he is threatened 
with loss of self-esteem, loss of a sense of control over what is happening, and 
loss of feelings ot competence. As these feelings accumulate or intensify, the 
individual's own behavioral repertoire becomes less flexible, less nuanced, 
and less positive. Raush argues that individuals bring to interactions sets of 
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cognitive schemata of images of self and others that have evolved over time in 
multiple relationships and contexts; ongoimg interactions tend to be assimi- 
lated to these established interpretive structures (see also Elder & Caspi's 
(1988) discussion of interactional continuitz). When anxiety-associated sche- 
mata are aroused, the individual's actions become less flexible and differenti- 
ated. As feelings of threat mount, options shrink to variants of fight (attack) 
and flight (escape). 

Gottman discusses the same phenomencn in terms of negative emotional 
arousal, noting both the sense of individual discomfort and the ways in which 
discomfort tends to interfere with individual and cooperative efforts to man- 
age arousal within ongoing interaction. He argues that negative affect and the 
release of stress-related hormones such as norepinephrine, epinephrine, and 
corticosteroids may interfere with higher order cognitive functions such as 
problem-solving, planning, and creative thinking. He says, "Such a reduction 
in the flexibility of cognitive functioning may result in a reliance on automatic 
and over-learned cognitive routines, and thus a tendency to resort to well- 
rehearsed, but highly maladaptive behaviors that adversely affect the couple's 
ability to resolve conflicts" (Gottman & Levenson 1987:191). In a con- 
troversial extension of this idea, Gottman »osits that men may have a lower 
threshold for negative arousal and a greater lag in returning to baseline. These 
differences may account in part for observed gender differences in family 
interaction (Gottman & Levenson 1987). 

The same processes can occur in parent-child interaction. In a thoughtful 
review aimed at distinguishing parent-child interaction patterns among clini- 
cally depressed parents from those of other parents, Downey & Coyne (1990) 
suggest that the parenting behaviors observed in clinically depressed mothers 
resemble those of mothers worn down by sach external stressors as economic 
pressures. The authors describe a scenario strikingly reminiscent of Patter- 
son's ineffective mothers. Preoccupied and low in energy level, these mothers 
exert less effort in interaction, including less total positive affect and less 
contingent positive affect. When faced with child noncompliance or mis- 
behavior, they choose strategies that require less cognitive effort and less 
interpersonal negotiation: They are more likely both to enforce obedience 
unilaterally or, alternatively, to withdraw or give up when faced by child 
resistance. This reduced set of choices produces both the negative, punitive 
parenting and the parental inconsistencies -hat have been noted as problema- 
tic. Downey & Coyne comment that distressed/depressed mothers, by their 
own example in opting for conflict resolution strategies that are less cognitive- 
ly effortful, may socialize their children. to resolve interpersonal conflict 
through coercion or withdrawal. 

A wide variety of circumstances may erouse feelings of threat regarding 
one's own safety, competence, and worth Marriage and parenting demands 
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themselves are potent sources of feelings of threat and inadequacy. But so too 
are adult occupational and economic experiences, which are an overlooked 
source of individual attributions that constrict behavioral choices within 
family interactions. Piotrkowski (1979) describes emotional responses to 
occupational stressors. She emphasizes how feelings of safety and com- 
petence at work increase workers' emotional and interpersonal availability at 
home, while anger generated by relatively low power and blows to self- 
esteem at work carry over in two predictable forms: expressing negative affect 
or irritability to family members (fight), or trying to withdraw from family 
interaction in order to recuperate (flight). 


OCCUPATIONAL AND ECONOMIC EXPERIENCES 
AND FAMILY INTERACTION 


A limited body of research provides some empirical underpinnings to these 
ideas. Linkages are observable at several levels. Some research looks at stable 
features of occupations likely to prompt a chronic sense of anxiety and threat, 
while other work considers fluctuations in occupational stressors that may 
elicit episodes marked by negative emotional states. The most common 
design uses individual reports of both usual conditions at work and usual 
interaction patterns at home. By necessity, these are aimed at relatively 
stable, persistent situations, typical conditions encountered and typical re- 
sponses. The analyses contrast individuals with differing and presumably 
relatively stable levels of occupational problems and differing interaction 
patterns. Studies of fluctuations collect repeated measures of occupational and 
family experiences and look at daily variations within individuals relative to a 
baseline calculated from their own average level. For example, when “worse 
than average" work days are followed by "higher than average" conflicts at 
home, "worse" is defined in terms of the individual's own average, not a 
sample’s-average. Such studies hold constant rather than analyze the varia- 
tions across occupations that may be related to variations in baselines. 
There is virtually no methodologically sophisticated study of linkages 
between workplace experiences and sequences of observed behavior within 
family interaction. As noted above, such studies tend to locate the source of 
the patterns within the dyad or group observed. Yet to be realized are studies 
that look at how interaction, or individual emotional or physiological reactiv- 
ity, at the more molecular level differs after a “good” work day rather than a 
"bad" one. Similarly, thus far, studies of how individuals interpret their 
spouses' or their children's behavior do not search outside the dyad for the 
causes of these differences, although individuals bring to their interpretation 
of interaction the problems or successes that they have experienced else- 
where. Thus, we would expect that a woman at odds with her boss all day is 
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more likely to read opposition into her spouse's responses as well. Equally, 
the man anxious and uncertain about his performance on the job may be 
hypervigilant regarding criticism from his spouse or children. While some of 
the studies reviewed are consistent with this idea, direct tests are still needed. 


Variations in Occupational Experiences 


Among employed workers, large variations occur in occupational conditions. 
Four aspects of working conditions have been emphasized: the extent of 
self-direction permitted; the amount of work demanded; the quality of in- 
terpersonal relationships with supervisors, coworkers, and subordinates; and 
the degree of job uncertainty and insecurity. We review this literature with an 
eye toward establishing how these various aspects of work may affect an 
individual's sense of safety, esteem, and competence, and thus impinge on 
family interaction. 


OCCUPATIONAL SELF-DIRECTION Work socialization theorists emphasize 
the opportunities for self-direction that a job offers and argue that greater 
opportunity to exercise self-direction at work improves cognitive habits and 
skills (Kohn & Schooler 1982, 1983, Miller et al 1985, Naoi & Schooler 
1985, Schooler & Naoi 1988). It is also argued that the degree of self- 
direction exercised on the job shapes more general attitudes and values. 
People are believed to generalize attitudes, values, and approaches learned or 
encouraged on the job to other settings as well, including family life (Schooler 
1987). The process recalls the role-person merger (Turner 1978) in which the 
attitudes and behavior required in one social role are displayed in other 
settings as well. Of course, Turner argues that such merger is more likely 
under some conditions than others; empirical research, which tends to look 
only at main effects, finds relatively weak overall effects. Nevertheless, 
job-influenced changes in cognitive skills and attitudes have been shown to 
influence the way that individuals spend their time off the job (Miller & Kohn 
1983). Lower opportunities for self-direction also shape more socially con- 
servative and guarded orientations to society (Kohn et al 1990; see also the 
comprehensive reviews of O’Brien 1986 and Spenner 1988). 

Such effects have implications for the values that parents hold for their 
children (Pearlin & Kohn 1966, Pearlin 1971, Kohn 1977, Kohn et al 1983, 
1986) and for the way that they interact with them. More substantively 
complex job conditions have been linked to valuing more self-directed quali- 
ties in one’s children and to encouraging them to be more autonomous and 
intellectually flexible (Gottfried & Gottfried 1988, Piotrkowski & Katz 
1982). Mothers in jobs involving more substantively complex work also 
provide more cognitive stimulation and affective warmth to their young 
children (Menaghan & Parcel 1991); and these better home environments are 
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associated as expected with better verbal development (Parcel & Menaghan 
1990) and fewer behavior problems (Rogers et al 1991). Parental values also 
influence their children's values, according to data from both the United 
States and Poland (Kohn et al 1986). 

It is interesting that studies of occupational self-direction and its effects 
have tended to look either at the workers’ own well-being or at approaches 
and values relevant to child-rearing. Little theoretical or empirical work has 
examined the impact of occupational self-direction on marital goals, values, 
or behaviors. Some evidence does suggest, however, that job conditions 
permitting greater self-directedness and complexity do affect spousal interac- 
tion. In a study of workers in a “participatory” milieu involving greater 
teamwork and more complex and diversified tasks, Crouter (1984) reports 
that such work activities “carry over” into more problem-solving efforts with 
spouses and children, as well as to a greater sense of control and competence. 
While such effects are generally considered positive, some working wives 
reported that their greater assertiveness in problem-solving was not entirely 
welcome by their husbands, for whom it seemed to threaten previously 
established patterns. 

Workplace socialization research on the effects of opportunity for self- 
direction and judgment emphasizes individual cognitive outcomes and paren- 
tal values, goals, and intentions, but the research does not develop links to 
affective aspects of interaction. Empirical research suggests, however, that 
the effects of such self-directing work conditions (or their absence, as in- 
dicated by simple tasks, repetitive work, high supervision, and little interac- 
tion or negotiation with others) go beyond cognitive skills and attitudes to 
affect the sense of self-worth, self-efficacy, and overall emotional well-being 
(Gecas & Seff 1989, Hibbard & Pope 1987, Lennon 1987, Menaghan 1990, 
Mortimer & Borman 1988, Mortimer et al 1986, Spenner & Otto 1985). It is 
by such pathways that occupational self-direction may lead to greater autono- 
my and problem-solving with family members in interaction. Studies of other 
occupational conditions have looked more directly at emotional con- 
sequences, documenting the stressful potential of high job demands, un- 
certainty about job tenure, and unpleasant or unsupportive workplace rela- 
tionships. 


OVERLOAD Much research has focused on the extent to which job demands 
are excessive and wearying. Bolger et al (1990) and Repetti (1989) study the 
amount of work demanded and the extent to which it constitutes an overload, 
while Rosenfield (1989) suggests that job and child-rearing demands combine 
to reduce adults’ sense of control. Rosenfield shows that the resultant reduc- 
tions in perceived control increase distress. These studies emphasize mis- 
matches between occupational demands and individual capacities or resources 
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that may produce either role overload or boredom and understimulation. 
Work by Karasek and others (Karasek 1972, Karasek et al 1981, Haynes & 
Feinleib 1980, House et al 1986) suggests that the combination of high job 
demands but low decision latitude, which was more characteristic of women's 
jobs than of men's jobs, is associated witn greater emotional and physical 
distress. Time pressures and long work hours (Miller et al 1979, Voydanoff & 
Donnelly 1989) are also consistently linkec to greater anxiety. These experi- 
ences may prompt a sense of anxiety aboat one's competence. 

Some studies have examined daily vaciations in job stress, emotional 
response, and family interaction. Repetti (1289) studied 33 air traffic control- 
lers and 27 of their spouses who completed daily surveys on three consecutive 
days. When spouses were supportive following high workload days, the 
heavy workload appeared to reduce overall social and emotional responsive- 
ness at home, and to decrease aggressive behavior. She speculates that social 
withdrawal within the family may promote recovery from negative work 
experiences. Repetti's. findings suggest that social withdrawal is one avenue 
for individual recovery from occupational stressors and protection of family 
interaction from negative job stressor effects; but Piotrkowski (1979) notes 
the costs this imposes on other family members. Since such withdrawal is 
possible only if other family members are available to pick up the slack and 
are willing to do so, this avenue is not zs likely to be available to single 
parents as to married ones. Crouter et al (1989) studied how fathers' reports of 
stress fatigue, depression, and alertness after work were related to home and 
family activities that evening. Consistent with Repetti, they found that higher 
work stress reduced involvement in the household and in active leisure 
activities and increased negative marital interaction (as reported by the wife). 

Bolger et al (1989) used daily diaries from both husbands and wives for a 
six-week period to examine how stressful work experiences (overloads and 
arguments with supervisors, co-workers, or subordinates at work) predicted 
feelings of overload, arguments with spcuse, or arguments with children. 
They also found that overload at work was likely to be followed by a 
reduction of home involvement; both home work load and occurrence of 
arguments were lower. When their spouse experienced work overload, re- 
spondent workload at home increased while their spouse's home workload 
decreased, suggesting that respondents were shouldering more of the work for 
a weary spouse. This effect was stronger for wives than for husbands, so that 
husbands were more likely to gain some breathing space at home after a bad 
day at work. 

One indicator of overload may be lorg work hours. While workers in 
low-paying jobs may work longer hours to increase earnings, many high- 
wage occupations also demand extensive tme commitments. More satisfying 
jobs may increase the positive quality of the parent-child interaction that 
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Occurs but limit the amount of this interaction. Grossman et al (1988) report 
that fathers who were more satisfied and involved in their own work spent less 
time with their children than did less work-satisfied fathers. However, the 
interaction of satisfied fathers was rated as more sensitive and more. suppor- 
tive of children's autonomy and affiliation. 

Although much of the concern about longer work hours affecting interac- 
tion with children has emphasized maternal work hours, Nock & Kingston 
(1988) suggest that the amount of mother's time with children is less affected 
by work hours than is fathers’ time. It appears that employed mothers try to 
protect time with children, but time in a couple's shared leisure activities 
diminishes under the constraints of wives’ employment (Hill 1988). Other 
researchers have speculated that when employed mothers maintain high levels 
of interaction with their children, fathers are pushed to the sidelines, making 
their time with children less than in households with non-employed mothers 
(Hoffman 1989). However, neither Lamb et al (1988) nor Crouter et al (1987) 
find this. 

Assessing overload in terms of long work hours is relatively easy. More 
difficult is assessing the extent to which preoccupation with occupational 
triumphs and troubles decreases emotional and interpersonal availability even 
though the employed person is physically present. Both highly ego-enhancing 
and highly frustrating occupations are likely to remain salient for the worker 
even away from the workplace (see Piotrkowsky 1979). Thus, more satisfying 
work may entail both longer work hours and greater work preoccupation, 
which thus undercut its emotional benefits. 


QUALITY OF INTERPERSONAL RELATIONSHIPS Other research emphasizes 
interpersonal relationships at work—both in terms of support from coworkers 
or supervisors (e.g. House et al 1986) and in terms of conflict and arguments 
(Bolger et al 1989). As might be expected, poor interpersonal relationships 
with job supervisors and coworkers arouse negative emotions and increase 
emotional distress (Hibbard & Pope 1987, Menaghan & Merves 1984, Repetti 
1988). In the Bolger et al (1989) daily diary study, interpersonal conflicts 
showed a clear spillover from work to home, with arguments at work likely to 
be followed by spousal arguments at home that evening. Both at work and at 
home, the occurrence of interpersonal conflicts perpetuated distress and 
depressed mood over time (Bolger et al 1989). Again, effects may interact 
with parallel problems at home. Repetti finds that the negative effects of poor 
job social climate and low supervisor support were stronger among the 
married women who also reported more inequality relative to their husbands 
in time spent on household work and care of children. 

Interaction at work may also affect family interaction in more subtle ways. 
Jobs vary in the amount of interaction with others that is permitted, or even 
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required. And while conflict may be less when interaction with others is 
limited, jobs demanding higher involvement with things and little contact 
with people may also have emotional ccsts. In part, these costs may be 
produced by the more visible and direct supervision under which such work is 
accomplished, as well as by the limited opportunities for rehearsal of coopera- 
tive problem-solving or interpersonal negotiation. Although this aspect of 
occupational conditions has not been extensively studied, Rogers et al (1991) 
find that among employed mothers with young children, mothers' lesser 
involvement with people on the job was associated with increased reports of 
under-controlled behavior among their children. This finding would be ex- 
pected if the negative emotions aroused. by such workplace experiences 
produced more coercive and inconsistent parenting (Patterson et al 1989), 
although Rogers et al had no direct measire of mother-child interaction. 

Finally, the gender composition of the workplace also has implications for 
worker emotions and family interaction. M2n and women differ on average in 
the conditions they encounter at work (Pa-cel 1989) since occupations, and 
jobs within occupations, remain segregated by sex, and much female employ- 
ment tends to be in “female ghetto" occupations (secretarial and clerical, retail 
sales, cashiers, beauty/cosmetics, young ckildren’s education). However, sex 
segregation has decreased somewhat (see Bielby & Baron 1984, Reskin & 
Roos 1990, and Jacobs 1989 for varying coaclusions regarding how much real 
change has occurred). In the past, much coworker interaction and coworker 
support occurred within gender groups. Such interaction may support and 
reinforce gender-differentiated perspectives on family problems and gender- 
differentiated conversational styles. As workplaces become more gender- 
mixed, opportunities for male-female inte-action will increase. Such oppor- 
tunities may have mixed effects. They may provide greater opportunities for 
extramarital romantic and sexual involverrents. Additionally, they may con- 
firm the status inequalities inherent in rruch male-female interaction. For 
example, work interactions of such gende--differentiated pairs as male man- 
agers and female secretaries, male doctors and female nurses, male dentists 
and female hygienists, echo the subordinare supporting role of women in the 
larger society. Still, the declines in segregetion, and the increases in women's 
representation in higher status positions, may provide greater male-female 
rehearsal of cooperative interaction among status equals, with implications for 
more egalitarian spousal relationships with n families. On the other hand, less 
job segregation may increase males’ sense Df threat concerning female power, 
with negative repercussions for interaction at work and at home (see Wharton 
& Baron 1987). 


JOB UNCERTAINTIES Other research d-aws attention to uncertainties at 
work—concerning job demands, evaluaticn of one's performance, job secu- 
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rity and occupational mobility. Even among relatively successful upper-level 

male managers, the threat (and experience) of being forced out, demoted, or 
having one's earnings take a sudden downward turn is quite common (Weiss 

1990; see also Newman 1988). Consistent with the argument that such’ 
conditions threaten one's feelings of adequacy, mastery, and worth, Weiss 

finds that husbands in these circumstances seek reassurance from their wives 

of continued belief in their abilities to deal with whatever difficulties arise, 

rather than advice or tactical assistance. To the extent that spouses could 

reduce the emotional threat of job uncertainties, the managers felt better able 

to resolve their problems. 

Pearlin & McCall (1990) describe similar dynamics for both husbands and 
wives. Workers who have suffered workplace assaults on their self-esteem 
and competence fear the loss of their spouse's esteem and admiration, so that 
they are sometimes reluctant to reveal their problems. The workplace origins 
of their moodiness, hostility, or withdrawal are ambiguous or denied. As a 
result, spouses interpret these feelings as either unwarranted or as somehow 
attributable to their own deficiencies. These interpretations trigger reciprocal 
anger or threat. In this way the negative affect originating at work may 
establish itself at home. 


Effects of Employment Status, Economic Loss, and Earnings 


Although the effects of occupational conditions on family interaction are 
relatively understudied, much research has considered the effects of unem- 
ployment and attendant loss of income. Horwitz (1987) has reviewed the 
major empirical studies examining the effects of unemployment and attendant 
economic losses on individual well-being: these studies suggest that the 
negative effects of unemployment are particularly severe when reemployment 
is delayed and when emotional and economic resources are relatively meager. 

Effects on sense of self and emotional stability seem particularly severe for 
men, for whom job loss appears to constitute a severe threat to feelings of 
self-worth. This is expectable given the very strong normative expectation of 
male employment and association between men's worth and their economic 
position (Bernard 1981). The pervasiveness of this norm is vividly illustrated 
in current debates about Afro-American families and the shortage of 
“marriageable” males (see, for example, Wilson 1987), in which all women 
are assumed to be reasonably marriageable, but men are not considered 
marriageable unless they are employed. Having made these assumptions, 
researchers decry the shortage of marriageable black men for black women. 
(This analysis also assumes that marriages occur within but not between 
ethnic groups.) Obviously, the conclusion would vary if unemployed women 
were also declared unmarriageable, or if other characteristics of both men and 
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women were considered in estimates of maarriageability. As it is, however, 
unemployed men are not taken seriously as reasonable marriage partners. 

Given these powerful expectations, it is not surprising that studies of 
men's, particularly married fathers', unemployment document powerful 
negative effects on individual well-being (Kessler et al 1987) and on the 
quality of participation in family life (Elder 1974, Elder et al 1985, 1986). As 
noted above, initial emotional instability of husband-fathers also plays an 
important role in translating employment bss into damaged marital relations 
and explosive father-child interaction, as Elder's studies have documented. 
Siegal (1985) draws attention as well to the likelihood that economic depriva- 
tion negatively affects children's perception of their parents, which in turn 
shapes children's behavior toward adults (.n addition to the more commonly 
discussed impact on parents' behavior toward children). He argues that 
economic deprivation is particularly like y to hurt father-son relations as 
fathers lose stature in the family relative to mothers (see also Elder 1974, 
Ellwood 1988, Voydanoff 1990). Having a man's fears of inadequacy and 
reduced self-worth confirmed by his children is likely to elicit still more 
negative emotional arousal. It is not surprising that fathers without jobs are 
described in the research literature as irritable, hostile, and explosive; they are 
seen as likely to be punitive and arbitrary to their children and irritated by 
increased interaction with them. The increased presence of the father, when 
unemployed, is unexpected, undesired, end nonnormative. Both the men 
themselves and their families see it as excessive father presence under un- 
fortunate circumstances with unhappy corsequences (see McLoyd 1989 and 
McLoyd 1990 for recent comprehensive xeviews). 

In contrast, women's, particularly wives , absence from the labor force has 
been viewed as less problematic. It is women's entry into employment that 
has been studied in terms of potentially negative consequences for women's 
own well-being, the emotional well-being of their husbands, and for marital 
and parent-child relations (see Hoffman 1589). Yet research suggests that for 
women as well as men employment tends to have positive mental health 
effects overall, particularly for breadwinnezs (Menaghan 1989, Wethington & 
Kessler 1989; for useful reviews see Roserfield 1989 and Repetti et al 1989). 
Recent work by Ensminger & Celentano (1990) indicates that gender differ- 
ences in employment effects may be weakening. Among parents with children 
aged sixteen and younger, the negative effects of becoming unemployed on 
distress were similar for married fathers, married mothers, single mothers, 
and single fathers. 


EARNINGS EFFECTS Sex segregation is 2xtensive in the labor market, and 
the jobs that women typically hold have lower average wages than those 
dominated by men. As a result, men and women are unlikely to gain equiv- 
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alent economic rewards from their efforts/hours at work. There are pervasive 
expectations that men will make more money, in different kinds of jobs, than 
women do; and that they will be more committed to work roles than women. 
Yet as Bielby & Bielby (1988) have shown, when men and women's work 
conditions are similar, their commitment to work is also similar. Bielby & 
Bielby suggest that for both men and women, commitment to family is on 
average higher than commitment to work. For both men and women, low 
wages increase family economic pressures, which in turn affect feelings of 
mastery and self-efficacy (Duncan & Liker 1983, Andrisani 1978, Mirowsky 
& Ross 1986, Downey & Moen 1987, Pearlin et al 1981) and spousal and 
parent-child interaction. (Moen et al 1983, Conger et al 1984, Siegal 1984, 
Piotrkowski et al 1987, Voydanoff 1987). 

Catherine Ross and John Mirowsky argue that at least among married 
couples, we are beginning to witness a reorientation of assumptions about 
men's and women's relative responsibilities. Ross & Mirowsky find that 
couples who share both breadwinning and childrearing/homemaking roles 
equally, and who believe in the validity of this arrangement, actually report 
higher psychological well-being than do other married adults (Ross et al 1983, 
Ross & Mirowsky 1988). But such couples are still a small minority. Even 
men who believe that they support their wives' employment tend not to view 
women as equally responsible to be family bread-winners (see Thompson & 
Walker 1989). Changes in wives' behavior have not yet altered these deep- 
seated differences in interpretations and expectations. Married men are un- 
likely to be asked how they handle it all, or to receive suggestions about 
switching to part-time work schedules. Such men are unlikely to be held 
primarily responsible for shortfalls in domestic performance. 

Thus, the effects of low earnings on well-being and self-worth and in- 
directly on family interaction may be somewhat stronger among husbands 
than among wives. Here the interpretation of the situation and the discrepancy 
between norms and actual behavior plays a role. Many contemporary hus- 
bands, no matter how little housework they do, are apt to think (perhaps 
correctly) that they are an improvement over past generations and should get 
some appreciation. Conversely, women, no matter how much they do at 
home, believe that their mothers did more and feel guilty. Similarly, wage- 
earning women even with low wages feel buoyed by the fact of some wages, 
while men with lower than average wages may feel like inadequate men (and 
their wives may agree with their self-assessment, further increasing male 
anxiety). 

Gender-differentiated expectations also influence the effects of married 
men's earnings relative to that of their wives. Husband-wife discrepancies in 
education, occupation, and earnings are expected to favor the husband, and 
the relative resources of men and women within marriages echo rather than 
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contradict the greater resources and pow2r of men in the larger society. 
Although in the last few decades wives’ economic dependency has been 
somewhat reduced (Sorensen & McLanahan 1987), lower wages and fewer 
work hours keep the contribution even of -nost full-time employed wives to 
the family income below 50%. But it is not -he continued economic inferiority 
of wives to husbands that is seen as potentially problematic in current 
research. Rather, the concern is precisely that if wives' earnings begin to 
approach or even exceed the economic cortributions of their husbands, their 
husbands' self-esteem will decline. 

Voydanoff (1987) reviews studies suppo«ting this idea. Although the stud- 
ies do not look directly at earnings, they find that when wives' educational 
attainment exceeds their husbands’, or her cccupational prestige is higher than 
his educational attainment would predict fo- him, husbands report less marital 
satisfaction (Hornung & McCullough 1981). In fact, consistent with the idea 
of high threat leading to a fight or flight response, such mismatches increase 
the likelihood of life-threatening family v.olence (Hornung et al 1981). As 
egalitarian gender norms become more wid*ly accepted, such negative effects 
may diminish and we may frame different research questions, perhaps even 
examining the potential negative effects on wives' self-esteem and well-being 
of having husbands whose earnings exceed their own. 


SUMMARY AND CONCLUSION 


The study of family interaction proceeds cn many levels. While much work 
utilizes self-reports of usual family patterms, there is increasing emphasis on 
observational studies, particularly detailed analyses of observed behavior 
sequences. Each of these approaches has ite disadvantages. Self-report studies 
are hindered by individual reporting inaccuracies and are affected in unknown 
ways by individual interpretations of expe-iences. On the other hand, obser- 
vational studies lack an "insider" perspective on the meanings and history of 
interaction that interacting individuals bring to the situation, and the research 
is affected to an unknown extent by individuals' awareness that they are being 
observed. Recent innovations partially cvercome these disadvantages by 
combining self-report and observational methods, along with the collection of 
physiological measures, to construct a muttifaceted understanding of interac- 
tion. 

The analysis of observed family interaction has focused on the behavior 
contingencies that differentiate happier and less happy marriages, or families 
with more and less aggressive children. Less satisfied couples and families 
with less competent children are alike im their over-reliance on coercive 
behaviors, negative emotional tone, and under-usage of positive, supportive 
behaviors. There is a strong reciprocal pu] to negative behaviors, leading to 
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malignant spirals of coercive or rejecting behavior sequences. Clearly it is no 
easy thing for many families to avoid falling into or persisting in these 
patterns, despite their negative outcomes. Such findings prompt greater atten- 
tion to factors outside the interaction, other than individual personality de- 
ficiencies, that may create a context conducive to unsatisfying and ineffective 
family interaction. ` 

This chapter identifies a common dynamic that may explain observed 
effects of occupational factors on such family climates. Such effects are more 
likely when individuals believe that their sense of safety, esteem, mastery, 
and competence are threatened. While such threats can readily arise in the 
course of family interaction, as studies. of behavior sequences have es- 
tablished, they may also originate outside the family. This chapter has 
reviewed evidence suggesting that a variety of workplace conditions— 
restriction of opportunity to exercise self-direction, work overload, poor 
quality of interpersonal relations on the job, low opportunity for cooperative 
problem-solving, job insecurities, job loss, and low earnings—have emotion- 
al repercussions that have negative implications for family interaction. 

This research has tended to take two alternative directions, which are aimed 
at somewhat different theoretical questions. On the one hand, some research 
has emphasized socially structured differences in requirements among 
occupations that would be expected to produce enduring differences among 
job incumbents in the extent of distress produced at work. Such studies 
illuminate the pervasive effects on family life of one's experiences in the 
economic structure. Unfortunately, it is difficult to disentangle the unique 
effects of jobs from the characteristics of individuals that may select them into 
those jobs, and also into differing marital and family situations. One way 
around the ambiguities of social selection and social causation arguments, and 
the alternative direction taken in some of the most innovative recent studies, is 
to treat each individual as his/her own control, and study variations over time 
in workplace experiences. While such studies are cleaner regarding the 
direction of causation, they hold constant, and leave unexamined, the large 
differences among occupations that individuals hold. Thus, they are unable to 
examine effects of such differences. On the other hand, they are in a stronger 
position to articulate the processes by which workplace distress invades 
family relationships. 

Studies of work-family linkages have tended not to use observational 
methods to investigate either workplace or family stressors, but this is an 
important direction tbat future research should take. Existing data sets with 
single observations of marital or family interaction could be usefully analyzed 
in terms of the occupational characteristics of the participants. To the extent 
that occupations differ in distress-provoking conditions, and emotional dis- 
tress compromises individual availability for constructive interaction, in- 
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dividual responses to specific categories of partner behaviors should be more 
negative than average contingencies would predict. Just as supplementing 
observational data with self-reports of attributions has increased understand- 
ing and prediction of behavioral responses, analyzing extrafamilial influences 
may provide both researchers and participants with a greater awareness of 
occupational context factors that are currently overlooked. 

Such studies consider pervasive and presumably persistent effects of posi- 
tion in the occupational and economic structure. But studies linking fluctua- 
tions in day-to-day work conditions with fluctuations in day-to-day family 
interaction can also take advantage of observational approaches. Indeed, such 
Jinkages would help to establish more adequately how and under what con- 
ditions, for example, arguments at work tend to be followed by arguments 
with one's spouse or children. By systematically and explicitly bringing the 
larger occupational context into the emerging analyses of the microstructure 
of family interaction, such research can illuminate the social sources of many 
of the bewildering obstacles to supportive and satisfying family interaction. 
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Abstract 


Writing in 1965, Lewis Coser observed, “The poor, a stratum recruited from 
hetërogeneous origins, belong to a common category by virtue of an es- 
sentially passive trait, namely that society reacts to them in a particular 
manner” (Coser 1965:142). This review examines the “urban underclass,” the 
latest effort to analyze, categorize, and react to poverty in America. It begins 
with a discussion of the continuing and pervasive appeal of cultural explana- 
tions as the root cause of poverty, perspectives which boast a myriad of 
scholarly proponents from the right and increasingly from the left. Structural 
theories, which by contrast find explanation for poverty in various, sometimes 
conflicting changes in the economy and have an equally loyal following, are 
also examined. A third, ethnographic approach with antecedents in the prior 
work of Liebow, Howell, and Stack is explored. This emerging perspective 
both embraces and eschews different elements of the other two, attempting to 
ignore labels and understand instead “how people in real communities devise 
collective responses to their problems” (Sullivan 1989:50). The review sug- 
gests that nascent attempts are now in evidence from all sides to produce a 
common ground—part culture, part structure. But it also concludes that the 
real advance will come about when theorizing expands beyond the limited 
arena of the culture versus structure debate. 


INTRODUCTION 


Scholars have always studied and written about poverty. But every few 
decades such work is singled out for public attention and scrutiny (Wilson & 
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Aponte 1985:233). It is then that the audience expands, the information is 
widely disseminated, and meaning becomes mired in long-standing disputes 
over issues of classification and obligation. “The vocabulary of poverty,” 
Michael Katz observed, "impoverishes political imagination" (Katz 1989:3). 
The purpose of this survey is to examine the current translations of "the poor" 
and to underscore some of the contradictions that result. 

The study of the underclass has brought to the surface both familiar 
arguments and a host of new disagreements. Definitions vary over issues of 
race and ethnicity (Moore 1989, Tienda 1989, Gomes & Fishman 1989), 
gender (Rossi & Wright 1989), employment (Testa et a] 1989, Mead 1989), 
and family structure (Bane 1986, McLanahan & Garfinkel 1989). Ken Auletta 
describes the class as consisting of four distinct elements: "hostile street and 
career criminals, skilled entrepreneurs of the underground economy, passive 
victims of government support and the severely traumatized" (Auletta 
1982:49). Douglas Glasgow adds gender to this list suggesting that a dis- 
proportionately large number are young men "ranging in age from 14 through 
27" (Glasgow 1981:9). Nicholas Lemann provides a less specific interpreta- 
tion identifying the main characteristics “as poverty, crime, poor education, 
dependency, and teenage out-of-wedlock childbearing" (Lemann 1986:41). 
Jencks is more to the point, noting "by common consensus [the underclass] 
includes only the undeserving poor" (Jencks 1988:23). And, from a different 
perspective, Nathan suggests, “There has been a distinctive structural change 
in social conditions in the United States over the past two decades that is 
expressed by the term underclass" (Nathan 1987:57; see also Hochschild 
1989:144). Wilson describes the class as "individuals who lack training and 
skills and either experience long-term unemployment or are not part of the 
labor force" (Wilson 1985:546, see also Ricketts & Sawhill 1986). 

Yet, even in the absence of agreement on content, the underclass label has 
rapidly taken on a meaning of its own in the public debate. A "vision of 
poverty,” William J. Wilson has written, “has found its most vivid expression 
in the lurid descriptions of ghetto residents that have flourished in the pages of 
popular magazines and on televised programs devoted to the emerging under- 
class" (Wilson 1989:9). Almost overnight politicians, journalists, and social 
Scientists rushed to document its terrifying presence in every major ur- 
ban center. The term's acceptance is so wide, in fact, that the thematic 
contradictions which lie just below the surface are seldom noted. That 
the class exists is generally not disputed (though the term itself is often ma- 
ligned), but on few other issues is there anything like common agree- 
ment. 

The early formulations articulated a growing concern expressed about 
people who were hungry and homeless and living in "impossible conditions.” 
A cover story in Time Magazine in August 1977, described the underclass as 
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“people who are more intractable, more socially alien and more hostile than 
almost anyone had imagined." They "seem to be stuck more or less per- 
manently at the bottom. Though its members come from all races and live in 
many places," Time reported, “the underclass is made up mostly of im- 
poverished urban blacks who still suffer from the heritage of slavery and 
discrimination" (Time Magazine 1977:18). The formulation was simple: 
hopeless people, mostly black, stuck at the bottom due to historic discrimina- 
tion. 

The first correction came in 1978, when Wilson published his controversial 
work on the declining significance of race. In it he argued, “the ultimate basis 
for current racial tension is the deleterious effect of basic structural changes in 
the modern American economy on black and white lower-income groups, 
changes that include uneven economic growth, increasing technology, and 
automation, industry relocation, and labor market segmentation" (Wilson 
1978:154). For Wilson, the underclass represented "a massive population at 
the very bottom of the social ladder plagued by poor education and low- 
paying jobs" (Wilson 1978:1) The correction is noticeable. Blacks are still 
part, but whites are also part, of a "massive population" whose origins are 
found not in historic discrimination but in a changing economy. 

The concern was readily transformed into a rash of articles devoted to 
descriptions, cures, and eventually to issues of blame. The Chicago Tribune, 
for example, carried a piece early in the 1980s about ghetto unemployment 
which said that blacks from the. South at first "held unskilled jobs involving 
lifting or shoveling or carrying, in factories, in steel mills . . . the jobs paid 
badly but were better than nothing. . . . It was precisely these jobs that 
disappeared fastest" (quoted in Gray 1989:23). This explanation was con- 
sistent with Wilson's notion of a declining economic base and was reflected in 
his later concept of a male-marriageability pool index (MMPI). But the 
Tribune went further, observing “the reasons for holding a low paying 
job have largely been eroded in the ghetto . . ; it is much simpler than a de- 
cline in the work ethic . . . both the status of holding a job and the need for 
money to live on have disappeared." Having first explained that there were 
no jobs, the Tribune then insisted that if there were jobs, they would re- 
main vacant. 

This became part of a more serious correction in the eighties, an attempt to 
downplay problems in the economy and to introduce an explanation of what 
one journalist called a “ferocious subculture” (Hamill 1988, Orfield 1988). 
Understanding poverty by culture was not a new strategy but rather one 
dredged up from scholarly annals and discarded by many academics the last 
time poverty was popular. The new discussions often noted the historical 
debate, sometimes vindicating Moynihan (Wilson calls attacks-on Moynihan 
“vitriolic” (Wilson 1987:21), sometimes reminding others of his general 
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repudiation (“the archetypal sexism on which [the report] rests is inextricable 
from its racism,” writes Jewell Handy Gresham in The Nation in 1989). 

Throughout the eighties, the two paradigms, culture on the one hand, 
structure on the other, competed for intellectual ascendancy. In their pristine 
state, they are very different. Cultural determinist theories assume that in- 
dividuals are "locked" into poverty by a "unique but maladaptive culture" 
(Baca Zinn 1989:857, Wilson & Aponte 1985:249, Gray 1989:2). Reflecting 
Fitzgerald's comment on the rich, the poor are viewed as different. As George 
Gilder in his 1981 Wealth and Poverty explained, "The problem is neither 
race nor matriarchy in any meaningful sense. It is familial anarchy among the 
concentrated poor of the inner city, in which flamboyant and impulsive youths 
rather than responsible men provide the themes of aspiration" (Gilder 
1981:153). For Nicholas Lemann, these themes produced "a way of life 
utterly different from that in the American mainstream" (Lemann 1986:32). 

Structural explanations blame faulty economic organization, often lament- 
ing, in the words of Michael Katz, that "poverty slipped easily, unreflectively 
into a language of family, race and culture rather than inequality, power and 
exploitation" (Katz 1989:8). As Ray Marshall, Secretary of Labor under 
President Carter, observed, “We know that a full-employment policy would 
be good business. The main reason we don't do it now is the neoconservative 
mythology that it didn't work. We ought not to let them get by with that" 
(quoted in Burnham 1989:124). For structuralists, the task was to reassert 
their control of the policy agenda. 

Over time, the ideological orthodoxy was sacrificed as researchers, weary 
of a controversy that had no easy resolution, appeared to collapse the oppos- 
ing sides with increasing frequency (Gans 1968:208). Auletta, whose 1982 
book on the underclass created a media sensation, provides the synthesis at 
which many had arrived. "Pushing aside the pieties, charts, and stereotypes," 
Auletta writes, “one sees that a segment of the poor are sometimes victims of 
their own bad attitudes, and sometimes victims of social and economic forces. 
Neither the right's desire to blame individuals, nor the left's to blame the 
system, addresses the stubborn reality of the underclass. Somewhere between 
the stale shibboleths lies the truth" (Auletta 1982:319). However, as Wilson 
& Aponte (1985) conclude, "There is as yet no definitive research to de- 
termine precisely [their] relative significance" (p. 249). 

At decade's end, the disordered discussion spawned a third, ethnographic 
perspective which at once embraced and abandoned major elements of each 
(Katz 1989:172). The exotic poor of the cultural determinists and the victim- 
ized poor of the structuralists wed in a harsh ethnographic reality because, in 
the words of Mercer Sullivan, "structural and individual frameworks beg 
crucial questions of how people in real communities devise collective re- 
sponses to their problems" (Sullivan 1989:50). 
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Cultural Determinists 


The cultural model, the model of choice of media analysts, boasts a myriad of 
scholarly proponents from the right and increasingly from the left. Its adher- 
ents generally conclude that individuals become poor by a set of wrong 
attitudes and, ultimately, wrong choices. Most agree that the origins of the 
problem are found in the breakdown of major institutions, the family, school, 
and church. The highlighted institution in decline is the family. Yet, lawyer, 
former civil rights activist, and current head of the Children's Defense Fund, 
Marian Wright Edelman's claim that “the poor are victims of growing up in 
fatherless homes, of not being taught the value of an education," has a very 
different starting point from that of Charles Murray who in Losing Ground 
asserted, “Facing no credible sanctions (i.e. inschool disciplinary measures) 
for not learning and possessing no tangible incentives to learn, large numbers 
of students did things they considered more fun and did not learn" (Murray 
1984:174). In an peculiar twist, Wilson marks out another path in his latest 
work, The Truly Disadvantaged. While rejecting a culture of poverty thesis, 
he nonetheless emphasizes "the sharp increase in social pathologies in ghetto 
communities," realities he thinks "liberals" ignore or refuse to accept (Wilson 
1987:10). These problems include “black crime, teenage pregnancy, female- 
headed families, and welfare dependency" and are, according to Wilson, 
outcomes of both culture and economic organization (Wilson 1987:21—22). 
Cultural maladies have structural origins, Wilson argues. 

For the most conservative cultural determinists, the disorganized and 
pathological family of old has been transformed into a unit made diseased by 
Society's good intentions. As Charles Murray illustrates: 


We have in effect changed the rules of the game for poor people. Essentially we said: "It's 
not your fault. If you are not learning in school, it is because the educational system is 
biased, if you are committing crimes, it is because the environment is poor, if you have a 
baby that you can't care for, it's because your own upbringing was bad." Having absolved 
everybody of responsibility, we then said: “You can get along if you have a baby but have 
no husband and no income. You can survive without participating in society the way your 
parents had to." And lots of young people took the bait. (Murray 1986:36) 


And like that earlier version, it is viewed as "the fundamental source of 
weakness in the community" (Moynihan 1965:5). At the same time, pro- 
ponents of the cultural model, in an apparent change of heart, appear to 
lament the passing of the once pathological family of the 1960s because, we 
are led to believe, its disappearance has produced in its stead, the present 
"aberration." So in the 1960s, the great fear was that parents were passing 
inappropriate norms and values onto their children, thereby programming 
them for failure. In the 1980s, the fear is that a "vanishing family", to use the 
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title of Bill Moyers television documentary, is not available to instill values of 
any kind into their offspring. 

Growing up outside the family, they conclude, produces a population 
unschooled in the most basic of lessons, a sense of right and wrong. As Glen 
Loury explains, “What is clear from recent journalistic accounts, as well as 
from more scholarly, epigraphic and sociological literature, is that the be- 
haviors and values of people living in these communities constitute an impor- 
tant cause of their long-term dependency. I'm referring to lack of parenting, 
participation in criminal activities, use of prime drugs and detachment from 
the labor force" (quoted in Lemann 1988:84). Wilson strikes a similar note 
writing of "the disproportionate involvement of blacks in violent crime. More 
than half of those arrested in cities for murders and nonnegligent manslaugh- 
ter, more than half of those arrested for forcible rape, and 64 percent of those 
arrested for robbery are black," he argues (Wilson 1987:22). 

The underclass is, in fact, often depicted as living by a code of their own, 
not so much with a culture of poverty as with a culture of the jungle. President 
Reagan, in barely disguised language before a delegation of Chiefs of Police, 
commented, “Only our deep moral values and strong institutions can hold 
back that jungle and restrain the darker impulses of human nature" (quoted in 
Burnham 1989:136). 

The welfare system is seen in this analysis as the major source for the 
perpetuation of the dangerous class. Murray finds that the plight of the poor 
began to deteriorate in the late 1960s as the Great Society programs grew. In 
particular, Murray documents that from 1965 to 1970 unemployment among 
black youths rose, an indication that programs harmed rather than helped this 
population. He discounts alternative explanations of job loss resulting from 
agricultural mechanization in the South, automation, and competition “due to 
the huge influx of women and baby boomers" into the job market in the 
North. His conclusion is rather that job loss, high rates of illegitimacy, and 
welfare dependency “were results that could have been predicted from the 
changes that social policy made in the rewards and penalties, carrots and 
sticks, that govern human behavior" (Murray 1986:25, 65, 155). 

Depicting the system as an overgenerous parent spoiling an unruly child, 
Gilder suggests, “The fundamental fact in the lives of the poor in most parts of 
America today is that the wages of common labor are far below the benefits of 
AFDC, Medicaid, food stamps, public housing, public defenders, leisure 
time, and all the other goods and services of the welfare state” (Gilder 
1981:122). 

Those who see evil in the system invent a simple remedy. Murray’s 
solution to the welfare crisis was: 


Scrapping the entire federal welfare and income support structure for working aged 
persons, including AFDC, Medicaid, Food Stamps, Unemployment Insurance, Worker’s 
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Compensation, subsidized housing, disability insurance and the rest. It would leave the 
working-aged persons with no recourse whatsoever except the job market, family mem- 
bers, friends and public or private locally funded services. (Murray 1986:227) 


But conservatives are not alone in blaming the welfare system. Eleanor 
Holmes Norton, former head of the Equal Employment Opportunity Commis- 
sion, concludes: "The welfare program—a brilliant New Deal invention now 
stretched to respond to a range of problems never envisioned for it—often 
deepens dependence and lowers self-esteem" (quoted in Royal 1985:35) 
Nathan argues, "There is a concept in economics that is helpful for this 
analysis; signaling. We may not be doing people a favor if we transmit signals 
about welfare rights and entitlements in a society that has a deep and strong 
Calvinist tradition that practically deifies the work ethic" (Nathan 1987:61). 

Linking the origins of the underclass to welfare was the most common 
explanation but not the only one. Some choose to locate the roots of the black 
underclass in the rural South. The argument is not new. In the 1950s, Duncan 
& Hauser argued that the influx of rural workers, unaccustomed to city life, 
had strained to the breaking point many urban communities (Duncan & 
Hauser 1960). In the country, Hauser reasons, an empty beer can may be 
disposed of by throwing it into the open fields. However, in the city throwing 
the beer can out onto the street causes environmental as well as health 
hazards. This "beer can in the cotton patch thesis," as Silberman labeled it, 
blames a changed habitat and a slowness of people to adapt to the new setting 
as the root cause of poverty and misery (Silberman 1964:37). 

In a widely read article published in The Atlantic, Lemann picks up a 
similar theme declaring that the origins of the black underclass "can be traced 
to the sharecropper South of a generation ago. Every aspect of the underclass 
culture in the ghettos," Lemann argues, "is directly traceable to roots in the 
South—and not the South of slavery but the South of a generation ago" 
(Lemann 1986:35). In succeeding paragraphs, he asserts, "it appears that the 
distinctive culture is now the greatest barrier to progress by the black under- 
class, rather than either unemployment or welfare" (Lemann 1986:36). 

Yet, for the cultural theorists of both the right and the left, the underclass 
does not represent a massive group at the bottom but rather a highly disruptive 
band of misfits. The most prominent features of the group are no longer 
joblessness and poor education but welfare dependency, out-of-wedlock preg- 
nancy, and petty crime. And though again unstated, one has a suspicion that 
they invent a social class all of whom are under the age of twenty-five, and 
“not participating in society the way your parents had to,” one explained. 

The refusal of the poor to work is a subject that gets increasing attention. 
Speaking of employment, Lawrence Mead writes, "The signs are that unem- 
ployment has become more voluntary, in the sense that jobseekers often 
choose to remain jobless rather than take jobs most accessible to them. The 
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problem is not that jobs are unavailable but -hat they are frequently unaccept- 
able, in pay or conditions, given that some income is usually available from 
families or benefit programs" (Mead 1985:70). Two things are assumed 
without comment: first, is the notion that there are accessible jobs, an idea 
which runs counter to the previous assertien that "even bad jobs have dis- 
appeared” (see p. 4), and second, that welfare checks are so generous they are 
sufficient to support family members typically not included in AFDC house- 
hold budgets. 

Auletta goes even further, linking the refusal to work to a propensity for 
crime, “What is indisputable,” he writes, “is that violence is more common 
among the underclass. Some see crime as z profitable or exciting career and 
would engage in it even if offered a good jcb" (Auletta 1982:47). Hence, the 
volunteerism of unemployment is joined to znd explained by the love of crime 
and violence. 

Though proponents of the cultural perspective tread lightly on the issue, 
race is central to the concept of the class formation. The underclass has 
become, in the words of Nicholas Lemann, “mainly a black problem” 
(Lemann 1988:81). Again Lemann writes, “If the underclass is defined as a 
group prone to out-of-wedlock childbirth, crime and low educational achieve- 
ment, there has been an identifiable black underclass for all of this century” 
(Lemann 1988:82). 

Race without racism is an important asoect of the explanation. For ex- 
ample, Virginia Senator and moderate Democrat Chuck Robb speaks for 
many when he suggests that "the barriers blocking black Americans from 
entering the mainstream have largely been -emoved. Now is the time for the 
social agenda to shift the primary focus from racism to the self-defeating 
patterns of behavior of a mostly black urben underclass" (quoted in Jackson 
1988:89). And in a more accusatory tone, Lemann asks, “Why have millions 
of all races found enough economic opportunity to permit an escape from the 
same inner cities where millions of others seem trapped forever?” (Lemann 
1986:61). 

It is not, however, discrimination that nas produced this outcome. The 
proof is found in the counter experiences o7 successful blacks. “We live in a 
time," writes Pete Hamill in Esquire, “when the old barriers to blacks have 
fallen, when the doors of the establishment have at least partially opened and 
yet we are seeing that too many blacks can’t even walk in the door." Time 
Magazine writes, on another occasion, “one of America's success sagas has 
been the rise of many blacks to the secure middleclass." And Lemann adds 
there are “two versions of black life since the sixties . . . in which blacks are 
splitting into a middle class and an underclass that seems likely never to make 
it” (Lemann 1986:33). 

The implication is clear: The black middle class has become a “model 
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minority" while the black underclass is culturally impaired. Wilson is most 
often cited in support of this view because he suggests that the removal of the 
black middle class from the inner city did much to undermine the presence of 
role models who reinforce proper values (Wilson 1987:49). It is a theme 
readily embellished by others and, according to Gans, "accepted as dogma" 
before it has been "shown to be valid" (Gans 1990:275). Lemann writes, for 
example: 


For most of this time, nearly all blacks had to live in traditional ghettoes, because of the 
strict residential segregation that prevailed nearly everywhere in the urban North. This 
meant that the black middle and working classes controlled the ghettos and set the tone 
there. But in the late '60's, when opportunities of all kinds suddenly opened up for blacks, 
the ministers and postal workers and schoolteachers began leaving the ghettos and moving 
to formerly white working-class neighborhoods. (Lemann 1986:82, see also Nathan 
1989:17) 


Still, there remains the problem of explaining how all whites escaped the 
underclass label; a particularly thorny issue since "roughly 70 percent of all 
the poor in America are white" (Harrington 1987:7). It is resolved by an 
academic sleight of hand (i.e. “Given the nature of the case, it is impossible to 
prove in a scientific fashion"—Murray 1984:35). Auletta explains the anoma- 
ly by maintaining that while "all races behave the same, in the rural South 
(where many of the white poor reside) defiant anti-social behavior is less 
prevalent; the underclass is less visible. In urban America, the members of the 
underclass are as omnipresent as potholes, and can be considerably more 
dangerous" (Auletta 1982:200). 

The addition of urban residence to the list of characteristics of the under- 
class legitimizes the pervasive anti-urban bias of the American public. In 
1962, Harrington made a point of suggesting that the “other America" was not 
"exclusively city dwellers" nor were they "exclusively black" (Harrington 
1962:43). But the new underclass, according to proponents, is very different 
from the traditional poor. Thus, Auletta writes, "the nation faces few prob- 
lems as formidable as the presence of a group of people, largely concentrated 
in its principal cities, who live at the margin of society" (Auletta 1982:24). 

Isolation, as opposed to invisibility, is seen as a significant part of the 
explanation for the development of the underclass. So it is not race per se, 
urban residence per se, nor even poverty that defines it, but the three in 
combination. Lemann asks, for example, “If, from the late sixties through the 
early eighties, the black urban lower class became significantly more isolated 
than it ever had before, wouldn't that help explain what happened?" (Lemann 
1986:51). And Nathan observes, “My view of the situation is that the under- 
class in a distinctively urban condition involving a hardened residual group 
that is difficult to reach and relate to" (Nathan 1987). 
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Cultural theorists conclude that the "real" underclass is, in the words of 
Auletta "relatively few in number" (Auletta 1982:25). This point is echoed by 
Nathan who declares, "it must always be added that we are talking about a 
relatively small subgroup of the poor" (Nathan 1987:58). Writing in 1987, M. 
Magnet in Fortune Magazine asked, “Who are the Underclass?” and re- 
sponded, 


They are poor; but numbering around five million. They are a relatively small minority of 
the 33 million Americans with incomes below the official poverty line. Disproportionately 
black and Hispanic, they are still a minority within these minorities. What primarily defines 
them is not so much their poverty or race as their behavior—their chronic lawlessness, drug 
use, out-of-wedlock births, nonwork, welfare dependency and school failure. (May 
17:139) 


How the final figure, five million, is arrived at is not explained. Is it two 
million welfare dependent women, their children and drug-addicted boy- 
friends? And is their criminality the major ingredient of their underclass 
status? Even without definition or documentation, the number is nonetheless 
readily repeated. In fact Hamill further reduces the population to "blacks 
{alone} trapped in cycles of welfare dependency, drugs, alcohol, crime, 
illiteracy, and disease, living in anarchy and isolation in some of the richest 
cities on earth" (Esquire March: 1988). New to the last formulation are the 
categories of disease and anarchy. In a ten-year period, the underclass has 
been transformed from surplus and discarded labor into an exclusive group of 
black urban terrorists. 

And what is to be done with this group? At least one cultural determinist 
proposes, with structural logic, government programs. Writing in the Wash- 
ington Post in September, 1990, Lemann advised creating for his sharecrop- 
per populations "programs to make the streets safe again, to improve inner- 
city education, to train people for jobs and then find them jobs for which 
they've been trained" (quoted in the Wilmington News Journal Sept. 2, 1990). 


Structuralists 


While the cultural perspective emphasizes individual lifestyles, the structural- 
ists examine fundamental weaknesses in the economy, making points they 
have made without interruption and to increasingly deaf ears since the late 
1950s. Crumbling families, bad attitudes, and welfare dependency are seen 
by many structuralists, if at all, as outcomes of faulty economic organization. 
So while the cultural theorists give the public exotic scripts with dead bodies, 
crack houses, and neglected babies, the structuralists offer aggregate data. In 
much of their work, the focus of attention is the working poor. The "truly 
disadvantaged," with one notable exception, are not distinguished from the 
rest. 
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This is intentional. In understanding the underclass, structuralists point to a 
“new industrial order" characterized by a significant shift from manufacturing 
to service sector employment (Kasarda 1989:28—33). Baca Zinn suggests, 
“The decline of manufacturing jobs has altered the most important role once 
played by the cities as opportunity ladders for the disadvantaged. Since the 
start of World War II, well-paying blue collar jobs in manufacturing have 
been a main avenue of job security and mobility for Blacks and Chicanos. 
Movement into higher-level blue collar jobs was one of the most important 
components of Black occupational advancement in the 1970s" (Baca Zinn 
1989:866). 

The structuralists contend that the new economic order has produced 
massive vulnerability among all workers (Bluestone & Harrison 1986, Axinn 
& Stern 1988:23—4). As one Census study revealed, in 1986, it was “es- 
timated that over five million workers had lost their jobs since 1981 after 
having been employed for over three years. Only 2596 found jobs that paid as 
well as the ones they lost, 34% were either unemployed or out of the labor 
force and the rest worked at lower paying jobs" (quoted in Axinn & Stern, 
1988:22). Put simply, structuralists argue strongly that America did not need 
as.many workers as it once did. 

What does this mean for the poor? For the structuralists, the problem of 
poverty is basically a problem of lack of income-producing employment. In 
1985, 33 million Americans were estimated to be living under the poverty 
line. Many of these, it is argued, worked at full-time jobs. Over 12 million 
people are poor not because they do not work, but because they work full-time 
at the minimum wage, which is now pegged at only 60% of the poverty line 
for a family of four. Structuralists vary, however, over explanations of the 
source of the problem. The segmented labor market approach of Harrison, 
Gordon, and others tends to mirror prior exploitation theories that suggest 
"when working, [minority workers] tend to be concentrated in jobs that are 
insecure, dirty, unskilled and at the bottom of the hierarchy of authority where 
there is little possibility for advancement" (Blauner 1972:23). These theorists 
argue that "good" jobs are disappearing. In their stead have been created jobs, 
“a larger share of which are either low-wage or part-time" or both. According 
to Axinn & Stern, two economists at the University of Pennsylvania, “almost 
a third of all new jobs created since 1980" may be so described. Of course, 
characteristic of this employment is the fact that health and pension benefits, 
disability and unemployment insurance are not paid. 

"The acceptance of such theories has been undercut by claims of the great 
American job-creation success story. As Kasarda revealed, “during the last 20 
years the U.S. economy has added nearly 40 million jobs” (Kasarda 1990:1). 
Moreover, it is claimed “the decline in blue-collar employment in the central 
city has been partly offset by expansion in ‘knowledge-intensive’ fields such 
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as advertising, finance, brokering, consulting, accounting, and law" (Wilson 
1987:102). What has resulted, these authors contend, “is a serious mismatch 
between the skills of inner-city blacks and the opportunities available to them" 
(Wilson 1987:102). According to Kasarda, the mismatch is one explanation 
of “why black unemployment rates have not responded well to economic 
recovery in many northern cities" (quoted in Wilson 1987:102). 

However, the ability of unskilled immigrants to find employment in the 
midst of native worker idleness is an argument used to counter the “mis- 
match" thesis. Critics submit that finding work does not require advanced 
training but rather a willingness to accept low pay and poor conditions. For 
this circumstance, a third set of explanations surfaced among structuralists. 
Sassen found, for example, “as industrial production has moved overseas, the 
traditional U.S. manufacturing base has eroded and been partly replaced by a 
downgraded manufacturing sector, which is characterized by a growing sup- 
ply of poorly paid, semi-skilled or unskilled production jobs" (Sassen 
1988:825). Immigrants, particularly undocumented ones, "are more likely 
than U.S. citizens to gravitate toward these jobs [which] are poorly paid, offer 
little employment security, generally require few skills and little knowledge 
of English" (Sassen 1988:827). Ironically, domestic minorities are often 
excluded from this employient by a combination of government protection 
for citizen workers and higher expectations. As Light & Bonacich illustrate in 
their "immigrant entrepreneurship" hypothesis, immigrant enterprises profit 
from a supply of "cheap labor [which] enables capital to circumvent local 
high-priced labor, while implicitly threatening those who remain employed 
with displacement if they become too militant or costly" (Light & Bonacich 
1989:331). It does so, they propose, because immigrant small business 
utilization of cheap labor is indirect and not “well covered by the protection 
U.S. labor forged" for direct employment (Light & Bonacich 331-33). The 
authors conclude that while "ethnic entrepreneurship is often . . . presumed to 
enable disadvantaged immigrants to escape from poverty," the long hours, 
hard work, and high risks of such businesses make the results problematic. 
For the "vast majority, the dream will never become a reality" (Light & 
Bonacich 1988:431—32). 

All agree that the problem of joblessness, regardless of source, is com- 
pounded with severe dislocation in the environment. “The social structure of 
today’s inner city,” write Wacquant & Wilson, “has been radically altered by 
the mass exodus of jobs and working families and by the rapid deterioration of 
housing, schools, businesses, recreational facilities, and other community 
organizations, further exacerbated by government policies of industrial and 
urban laissez-faire that have channeled a disproportionate share of federal, 
state, and municipal resources to the more affluent” (1989:10). The 
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authors clearly suggest that the system is intimately involved in the production 
of the underclass world. 

Particular concern is expressed about a lack of public discussion of and 
attention to black youth unemployment. "The culture of poverty hypothesis 
would make more sense," Ellwood & Summers explain, "if black youth 
unemployment were concentrated in the ghettos" (Elwood & Summers 
1986:101). However, the authors found that the range of employment for 
black youth did not vary much between central cities, suburbs, and rural areas 
(3296, 3896, 3596 respectively). It was, in each case, considerably "lower than 
the 62% figure for whites" (Ellwood & Summers 1986:101). “Black income,” 
Sidel notes on another occasion, “is more related to the availability of job 
opportunities for blacks than to levels of education—in other words, blacks 
are being widely discriminated against in the workplace" (Sidel 1986:23). 

And, the structuralists suggest, rather than accepting a view of a widening 
gap between the poor and the "normal population," it must be acknowledged 
that for many of the latter there is a narrowing split. So, for example, Axinn & 
Stern point out, 


The "middle class" mother of three who leaves her husband is likely to be one of the poor, 
as is the blue collar worker whose factory closes down and leaves him without job 
prospects, the young worker who is disabled in an accident and cannot find work, and tbe 
elderly woman who survives her husband and then must live on one-half of his Social 
Security check (Axinn & Stern 1988:6). 


A workforce once hampered by substitutability is now burdened with an 
even greater problem of redundancy. And, they suggest, the black population 
having borne so much of the brunt of this, may now sit back and observe the 
spread to other groups. Thus, "between 1973 and 1984, adult poverty swelled 
by 2.3 million white men, 2.7 million white women, 517,000 black-males and 
769,000 black females" (Axinn & Stern, 1988:102). This meant an increase 
of 39% for white males, 23% for white females, 8% for black males and 2% 
for black females. The authors conclude, “By 1983 among those able to 
secure work for the full year there was a clear convergence between blacks 
and whites. Still the convergence was caused as much by the declining 
economic status of whites as by the improved circumstances of blacks" 
(Axinn & Stern 1988:113). 

Even well-established patterns like the high rate of black teen births are 
presented with qualification (Neckerman et al 1988:398). Edelman points out 
that “while black teen births are dramatically higher than white rates, 67% of 
teen births in America are to whites" (Edelman 1987:27). Moreover, in the 
last few years teen births have fallen in the black population and risen 
significantly among the white population. Indeed, “75% of the children who 
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fell into poverty between 1979 and 1982 were white" (Edelman 1987:250). A 
point, she concludes, “too often forgotten in our rush to explain the anomalies 
of poverty simply in terms of black urban culture." 

Structuralists counter the "unprecedented prosperity" arguments of the 
cultural determinists with documentation of the political changes that oc- 
curred in the Reagan years which dramatically "cut means-tested programs 
benefitting the poor" (Slessarev 1988:357, Skocpol 1988:294, Palmer & 
Sawhill 1984). The 1981 Reagan budget, for example, produced a $10 billion 
cut in subsidized housing, "far in excess of cuts in any other program" 
(Slessarev 1988:359). Significant reductions were also made in Medicaid, 
"community and migrant health centers, child immunization, family planning 
and mental health." Millions of welfare families also faced severe household 
budget cutbacks or removal from rolls altogether (Palmer & Sawhill 1984). 

Structuralists also produce a very different interpretation of the welfare 
state (Weir et al 1988, Burtless 1986). For example, in examining the 
question of "government welfare program blame for exacerbating single- 
parent households," Ellwood & Summers found for blacks, in the period 
between 1972 and 1980, that while "the number of all children in single- 
family households rose 20 percent, the number of black children on AFDC 
actually fell by 5 percent" (Ellwood & Summers 1986:93-94). The authors 
conclude, “Knowingly, or unknowingly, we have been engaged in an experi- 
ment over the past ten years. We have cut back AFDC benefits considerably. 
There has been no noticeable effect on family structure or work” (p. 97). 

Analyzing data from the University of Michigan Panel Study of Income 
Dynamics on “AFDC receipt within and across generations,” Duncan et al 
found “that the stereotype of heavy welfare dependence being routinely 
passed from mother to child” was contradicted by their data (Duncan et al 
1988:469). The authors concluded that “a surprisingly large proportion of 
welfare experiences are only shortlived, that the typical total length of welfare 
receipt is less than 4 years, and that most children growing up in heavily 
dependent homes do not become heavily dependent adults” (Duncan et al 
1988:420, see also Bane 1988). The welfare system does not produce wide- 
spread dependency, they argue. 

The Michigan data have also been used to test the link between motivation 
and poverty. “This research found no evidence that highly motivated people 
are more successful at escaping from poverty than those with lower scores on 
tests” (quoted in Baca-Zinn 1989:861, Levitan & Johnson 1984:106). 

The so-called generosity of the welfare state is also a favorite theme of the 
structuralists (Burtless 1989:27). They are fond of pointing out that Aid to 
Families of Dependent Children, the pariah of many welfare reformers, for 
example, pays benefit levels at 50% of the poverty line in only 19 of the 50 
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states. Further, only 55 out of every 100 eligible poor persons in this country 
receive AFDC benefits of any kind (Edelman 1989:41—42). 

On the subject of culture itself they reserve their strongest invective. Over 
20 years ago the sociologist Lewis Coser observed of the culturalists in the 
culture of poverty debate, "whether they intend it or not, they create the 
impression that the members of that culture are somehow alien beings who 
cannot be assimilated and hence must remain beyond the pale (Coser 
1965:140). Writing in 1989, Wacquant & Wilson noted, “Severed from the 
struggles and structural changes in the society, economy, and polity that in 
fact determine them, inner-city dislocations are then portrayed as a self- 
imposed, self-sustaining phenomenon" (Wacquant & Wilson 1989:9). Troy 
Duster described it as, “a class of persons permanently consigned to a status 
*from without' all the while inside its borders" (1988:7). 

Still structuralists suffer more than cultural determinists from the problem 
of disagreement within their own ranks. In 1987, Wilson published The Truly 
Disadvantaged, the single most comprehensive discussion of the underclass 
to date. Where previously Wilson had specified massive numbers of both 
races at the bottom, in 1987, he notes that, "The social problems of urban life 
in the United States are, in large measure, the problems of racial inequality. 
The rates of crime, drug addiction, out of wedlock births, female-headed 
families, and welfare dependency have risen dramatically in the last several 
years, and they reflect a noticeably uneven distribution by race" (Wilson 
1987:20). Further, he begins his analysis with an attack on the "liberal 
perspective" which, he suggests, is "reluctant to even acknowledge the sharp 
increase in social pathologies in ghetto communities" (Wilson 1987:6). 
Though he is adamant that his is not a “culture of poverty" perspective, many 
find the distinction puzzling. 

Wilson's male marriageability pool index has exposed another disagree- 
ment. While many praise his attempt to highlight the problem of male 
unemployment in the ghetto (see for example Billingsley 1989, Dill 1989, 
Sidel 1986), both Billingsley and Dill suggest that his decided focus on male 
provider roles gives "short shift" to the plight of poor women (Billingsley 
1989:29, Dill 1989:73). Billingsley quotes a Robert Hill finding that, “Black 
women are poor not so much because they do not have husbands as because 
they do not have good jobs" (Billingley 1989:29). As Baca-Zinn observes, 
"Instead of exploring how women's opportunities and earning capacities 
outside of marriage are affected by macrostructural economic transforma- 
tions, instead of calling for pay equity, universal day care and other initiatives 
to buttress women's capacities for living independently in the world . . . 
Wilson goes in exactly the opposite direction" (Baca Zinn 1989:872—73, Reed 
1988:168). 
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Structuralists also continually debate the use of the term itself (Gomes & 
Fishman 1989:86, Newby 1989:126). Richard McGahey writing in the New 
York Times stresses a common theme among a number of structuralists. “The 
underclass," he writes "is a destructive and misleading label that lumps 
together different people who have different problems. And that is the latest 
of a series of popular labels (such as the ‘lumpen proletariat,’ ‘undeserving 
poor,' and the 'culture of poverty") that focuses on individual characteristics 
and thereby stigmatizes the poor for their poverty" (quoted in Wilson 1987:6). 
Suggested Gans, "the increasingly pejorative term seems to be becoming the 
newest buzzword for the undeserving poor." Hence, concluded Gans, "the 
term has taken on so many connotations of . . . blameworthiness that it has 
become hopelessly polluted in meaning, ideological overtone and im- 
plications and should be dropped" (Gans 1990:271)—a point with which 
Wilson himself seems to agree as he called for the abandonment of the use of 
the term in favor of "ghetto poor" at his 1990 ASA Presidential address in 
Washington. 

Changing the term would represent a decided advance in the discussion if 
coupled with an agreement that the persistently poor are not different from the 
rest of the poor. As Billingsley illustrates: 


If one asks how is underclass different from poverty, the answer is that it includes poverty. 
If it is asked how does it differ from unemployment, the answer is that it includes 
unemployment. If the question is how does underclass differ from long-term welfare 
recipients, single-parent families, teen pregnancy, and so forth, the answer is that it 
includes all those. And now street crime has been added to the definition. This label, thus, 
hardly advances our knowledge or understanding about the lives of poor residents in inner 
cities. (Billingsley 1989:23) 


Yet even some structuralists hesitate to embrace this thesis. So Duncan et al 
after rejecting the causal link between welfare and dependency, conclude, 
* Although long-term recipients constitute a minority of all individuals who 
ever receive welfare, the fact that more than two million families are per- 
sistently dependent on welfare raises questions of whether welfare itself 
promotes divorce or out-of-wedlock births, discourages marriage, erodes 
work effort, or instills counterproductive attitudes and values that encourage 
reliance on welfare" (Duncan et al 1988:470). 


Ethnographers 


Ethnography, "the poor sociologist's substitute for the novel," is often viewed 
as "subjective, idiosyncratic and eccentric" (Bell & Newby 1972:1). Mere 
journalism, it has been called. Yet Michael Katz has written that "its minimal 
contribution is the destruction of stereotypes; its signal achievement, in the 
work of its most able practitioners, is a portrait of the world from the vantage 
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point of its subjects." There is nothing new about the ethnographic approach 
tò the study of poverty. Indeed, the last time poverty was popular, debates 
were greatly aided by what became classic studies—Eliot Liebow’s Tally’s 
Corner (1967), Joseph Howell's Hard Living on Clay Street (1973) and Carol 
Stack's All Our Kin (1974). 

In particular, Eliot Liebow's work on street corner men in Washington 
reveals the way ethnography haunts the other models. Liebow found: 


The streetcorner man does not appear as a carrier of an independent cultural tradition. His 
behavior appears not so much as a way of realizing the distinctive goals and values of his 
own subculture, or of conforming to its models, but rather as his way of trying to achieve 
many of the goals and values of the larger society, of failing to do this, and of concealing 
his failure from others and from himself as best he can. (Liebow 1967:222) 


But in the late eighties, arguments for the newness of the class demanded a 
new research (Hagedorn 1988. Sullivan 1989, Monroe & Goldman 1988, 
Rose 1987). Elijah Anderson's work among black males in Chicago and 
Philadelphia was one of the first to satisfy that demand. Anderson attaches 
rich descriptive detail to the meaning of values and identity of a "marginal" 
segment of the urban poor. “The emerging values seem primarily to involve a 
Social process, open to negotiation, of settling for what they have to work 
with in demanding deference and social regard," observes Anderson (An- 
derson 1978:211). These experiences are not captured in conventional poverty 
charts, graphs, and tables (Katz 1989:171). Anderson, for example, adopts a 
part culture, part structure position when he writes, "With the dearth of 
well-paying jobs, public assistance is one of the few reliable sources of money 
in the community, and, for many, drugs is another . . . In these circum- 
stances, babies and sex are used by some for income; women receive money 
from welfare for having babies, and men sometimes act as prostitutes to pry 
the money from the women. The community seems to feed on itself" (An- 
derson 1989:77). Yet his studies are not suspended from the general dis- 
course. Like Wilson, Anderson argues strongly for a historic ghetto where 
"mentors within the black middle class" living among the poor "supervised 
young people, offered adult role models, and sustained strong local in- 
stitutions” (quoted in Katz 1989:179, see also Anderson 1978:210). 

According to Jencks, this view is not supported in John Johnstone's study 
of a Chicago neighborhood. Johnstone found "that poor teenagers with 
affluent neighbors reported having committed more serious crimes than poor 
teenagers who lived in poor neighborhoods" (quoted in Jencks 1988:30). 
Reed is also uncomfortable with what he calls these notions of a “largely 
mythical past," commenting, "The fact is that that safe, organic community 
was a more limited, complex and problematic entity than nostalgia would 
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allow, and its glue was not so much nuclear, “intact” families as the im- 
peratives of racial segregation" (Reed 1988:168). 

On the topic of absent fathers, Mercer Sullivan found "cultural" (defined as 
"collective responses of people with distinctive group histories to difficult and 
changing structural positions in society") differences not in sexual activity but 
in the use of contraceptives and in response to pregnancy among the ethnic 
poor. In the typical black underclass neighborhood, he found a high level of 
*disapproval of abortion" but a low level of immediate marriage or coresi- 
dence. Even in the absence of marriage, however, fathers took some 
responsibility for their children. “Most provided some measure of care and 
support, to the extent that they were able. They contributed money, some 
from employment, usually part-time and/or low-wage, other from criminal 
activities" (Sullivan 1989:53—54). 

“Primordial cultures" out of the mainstream, however, are less often noted 
in ethnographic work. Indeed, Williams & Kornblum found that the teenage 
mothers of their sample spend hours and hours watching television, leading to 
the suspicion that they get many more of their ideas about life from the media 
than they do from each other (Williams & Kornblum 1985, see also Anderson 
1989:62). The authors found, for example, a strong awareness of weapons by 
young drug dealers not from their community connections but from Clint 
Eastwood movies (Marks 1989:62). 

It is not only "culture" that is challenged in these vignettes. Jay McLeod, in 
Ain't No Making It, forges yet another middle ground between culture and 
structure in explaining upward mobility; he argues that "aspirations provide a 
conceptual link between structure and agency because although they are 
rooted firmly in individual proclivity (agency), they also are acutely sensitive 
to perceived societal constraints (structure)" (quoted in Katz 1989:172). 


Conclusions 


Understanding the underclass in the midst of competing world views and 
ideological verbiage is a ‘difficult task. “Culture” to the determinists seems 
more like some highly contagious social disease than a shared system of 
knowledge that constitutes a way of life passed from generation to generation. 
Like Marx’s discussions of the French peasantry which he declined to call a 
class, the underclass as presented seems to have a “mode of production which 
isolates them from one another instead of bringing them into mutual in- 
tercourse" (Caute 1967:74). The group appears amorphous and represents not 
so much a class as a hodgepodge of societal failures: the homeless, the 
mentally ill, physically disabled, and a variety of the poor; elderly and young, 
single parented and intact families. Some, such as Auletta's "passive victims 
of governmental support," are relatively néw. But most have been studied and 
analyzed with great frequency over the past one hundred years. Many sound 
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suspiciously like Marx's "dangerous class, the social scum, that passively 
rotting mass thrown off by the lower layers of society" (Caute 1967:181). 

Remedies for the structuralists have always been their weakest point. 
Solutions frequently involve mundane "programs of job training, child care 
and work-oriented education," programs for which, at best, there is but slight 
public taste and even less evidence of effectiveness (Bassi & Ashenfelter 
1986, 151). Indeed; structuralists become faint-hearted in the face of their 
own analyses. If the economy is crumbling from uneven growth, automation, 
and labor market segmentation, what good will it do to develop job training 
and work-oriented education programs for those at the bottom? (Bonacich 
1989:43-44). This point becomes particularly timely as we witness the daily 
flight of young workers from "full employment" socialist economies. 
Moreover, arguments from the structuralists for an underclass embedded 
within the poor tend to downplay the urgency of the problem. Slipping back 
into discussions of "the poor ye always have with you" is doomed to be 
matched with and linked to "benign neglect," community self-help, or "back 
of the hand" remedies. , 

And the ethnographers, admittedly advantaged by direct contact with the 
underclass, nonetheless are often forced to concede the obvious biases of their 
small and selective samples (“a fraction of the underclass”) and conclude with 
“a call for more research” (Hagedorn 1988:111, 165). 

Through it all a lingering suspicion remains that only bits and pieces are 
known, in Billingsley words, “about the lives of poor residents of the inner 
cities” (Billingsley 1989:23). Future research will undoubtedly continue the 
trend of attempting to discover a common ground, palatable to both sides. 
Wilson’s notion of cultural pathologies with structural origins has so far been 
met with heavy skepticism. It may, however, signal a first step in this 
process. The ethnographer’s portraits of a vulnerable poor who do not start 
out on drugs, in gangs, or in jail, but have little opportunity to offset the 
concommitant effects of low education and unemployment, are persuasive. 
“When jobs are not available, living up to rules of conduct based on the values 
of ‘decency’ becomes difficult, and those rules based on the residual values 
become a more viable alternative for maintaining self-esteem” argues An- 
derson (Anderson 1978:210). 

But the real advance in the underclass discussion will come about when 
theorizing expands beyond the limited arena of the culture versus structure 
debate. The first step is a realization that the poor are neither homogeneous 
nor monolithic. The individual pathologies of a small subset, no matter how 
unsavory, are not relevant to solving the problems of the majority. Second, 
the plight of the poor is imbedded in a global economy and attached to 
economic processes that are international. The insistence on limiting the 
debate to local conditions or more specifically the populations of Chicago and 
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New York is shortsighted. Third, planning for the future involves more 
significant societal changes than retooling or reunionization. A decision must 
be made to choose between guaranteed incomes and national health care or 
modern almshouses. As Michael Katz has concluded, "The fundamental 
questions are not about the details of policy or the sources of revenue; they 
are, rather, about the basis of community, the conditions of citizenship, and 
the achievement of human dignity. They are, that is, about our definition of 
America and just how much we are willing to do to realize it" (Katz 


1989:239). 
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Abstract 


Few aspects of international social change have generated as much scholar- 
ship as patterns of urbanization in the Third World. In this review of in- 
terdisciplinary research, we first trace the trends and dimensions of urbaniza- 
tion in developing countries and then discuss major theories guiding global 
urban studies. Second, we review and critique recent cross-national in- 
vestigations of the determinants of urbanization and its dimensions, conclud- 
ing that severe underspecification, the dearth of comparative statistics on 
critical dimensions, and the ambiguity of proxy variables hinder research in 
this area. Finally, we discuss issues that warrant additional investigation in 
the near future. 


URBAN TRENDS IN THE THIRD WORLD 


The Third World is experiencing an urban explosion. In 1950, only 285 
million people, or 1646 of its total population, dwelt in urban areas. By 1985, 
the number of urban. inhabitants in developing countries had expanded to 1.2 
billion (30% of the total). The United Nations projects that during the 1990s 
the urban population in developing regions will rise to approximately two 
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billion (more than twice that found in developed regions), and that shortly 
after the turn of the century the majority of all Third World people will be 
urban residents (United Nations 1989). Simple arithmetic reveals that the net 
addition of urban population in developing countries between 1980 and 2000 
will be greater than the total urban population in developed countries in the 
year 2000. During the 1990s alone, Third World cities will grow by an 
aggregate of over 160,000 persons per day. 

This is not to suggest that Third World urbanization rates, or growth in the 
proportion of the population in urban areas, are particularly anomalous by 
historical standards. The rise of urbanization in the aggregated Third World is 
broadly comparable to the same process in the West over a century ago. What 
is unusual is the absolute increments of Third World urban populations, which 
clearly exceed any historical precedent. By the best accounts, urban popula- 
tion increase in developing countries is roughly double that experienced in the 
West at comparable levels of development (Preston 1979, Williamson 1988). 

Several differences in the situations of the contemporary Third World and 
the First World at the inception of industrialization help to account for 
differentials in urban trajectories. First, Europe and North America enjoyed 
superior natural endowments and higher absolute levels of development, 
attributes which moderated the problems associated with rapid urbanization. 
More importantly, natural population increase was much lower in the West 
during the Industrial Revolution than is the case in much of the Third World 
today. As Williamson (1988) notes, the growth rate of the English population 
between 1776 and 1871 was approximately half that of the contemporary 
Third World, a fact that goes far in explaining the explosion of Third World 
urban populations. Finally, the global context differentiates First World and 
Third World experiences. Western industrial pioneers did not face more 
advanced economic competitors. Credit and consumer markets were more 
accessible, and relatively inefficient production practices which could not 
compete effectively today were quite profitable in that era. Moreover, the 
discovery and colonization of the New World allowed Europe to export its 
surplus population. Bairoch (1988) estimates that Europe lost between 42 and 
46 million people to international migration prior to 1913, populations that 
probably would have migrated to European cities in the absence of this 
colonial safety valve (Massey 1988). 

Although the discussion above suggests that the growth of urbanization and 
urban population is uniform throughout the developing world, such is not the 
case. Between 1950 and 1990, the most rapidly urbanizing nations were to be 
found in northern and tropical Africa and southern and southeast Asia, a fact 
partially explained by the small urban base of such nations at the beginning of 
the period. On the other hand, most of the countries in Central and South 
America, which exhibited higher levels of urbanization in 1950, have ex- 
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perienced declining annual rates of urbanization since 1960. For these nations 
the management of rapidly expanding urban populations rather than the 
swiftness of the rural-to-urban transition has become an overriding concern 
(Roberts 1989). 

The most anomalous region in terms of Third World urbanization is east 
Asia, where rapid economic development, rapid rural-to-urban migration, and 
declining population growth have led to very rapid urbanization in such 
nations as Korea (Renaud 1981). The Chinese case is different still, where 
very stringent controls on migration have lent China growth rates more typical 
of nations at higher levels of urbanization, at least in the 1960s and 1970s 
when such controls were strictly enforced (Parish 1987, Roberts 1989). 
Nonetheless, most of the Third World continues to experience rapid urban 
growth. 

The evolution of Third World megacities punctuates this astounding expan- 
sion of urban population. Mexico City, for instance, contained 3.1 million 
inhabitants in 1950 but is expected to exceed 24 million during this decade. 
Likewise, Sao Paulo, a city of just 2.8 million in 1950, is projected to reach 
approximately 24 million by the year 2000 (United Nations 1989). As a 
comparison: it took New York City (the world's largest metropolis in 1950) 
nearly a century and a half to expand by eight million residents. Mexico City 
and Sao Paulo will match this increment in less than 15 years. 

Although smaller in magnitude, the dramatic growth of Mexico City and 
Sao Paulo is echoed in developing nations around the globe, resulting in a 
swift multiplication of cities containing a million or more residents, from 31 
such cities in 1950 to 150 today, with a projected 279 in the year 2000. By the 
end of this decade, nearly one out of two urbanites in the Third World will 
reside in a city of one million or more persons (UNCHS 1988). 

Although the absolute sizes and growth rates of cities and urban pop- 
ulations do lend particular relevance to Third World urban studies, de- 
mographics alone does not fuel scholarship in this area. Foremost, urbaniza- 
tion is both a manifestation of development processes and an important 
explanatory variable in most major theories of social and economic change in 
the Third World. The reasons underlying this emphasis on urbanization as a 
Social and economic indicator are its relationships to internal economic 
growth, human welfare, the diffusion of science, population distribution and 
demographic patterns, and processes of global development. 


DIMENSIONS OF URBANIZATION 


Numerous dimensions of urbanization have attracted the attention of social 
scientists in the past. A few of these dimensions, such as the proportion of a 
population living in urban areas, are easily grasped and usually generate little 
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controversy, while others, such as urban primacy, involve several complex 
notions that defy simple definitions or operationalizations. Despite differing 
degrees of complexity, research on Third World urbanization has at one time 
or another touched upon a wide variety of urban dimensions, and we 
summarize these in turn. 


The Level of Urbanization 


Definitions of urban place involve many related concepts such as a complex 
division of labor, density, size, and characteristics of the built environment. 
Most analysts, however, rely on population size within a bounded area as the 
basic criterion separating urban from rural places. Also, because national 
definitions of urban areas vary, many now use the United Nations definition 
of urban localities (20,000+) and cities (100,000+) to standardize com- 
parisons of urban development across nations (UNECA 1968). 

The level of urbanization (or percent urban) has been associated with 
numerous positive societal outcomes such as technological innovation, eco- 
nomic progress, and higher standards of living (Teune 1988, Bradshaw & 
Fraser 1989). It has also been linked to social and environmental problems 
and to perceived mismatches between population distributions and economic 
development (Dogan & Kasarda 1988a,b, Timberlake 1985a). Apropos the 
latter, much debate continues to focus on the role of Third World urbanization 
as it influences and is influenced by economic base changes, employment 
opportunities, city-size distributions, hinterland development, and social and 
spatial inequality. 


Overurbanization 


The concept of "overurbanization" (Davis & Golden 1954) has experienced 
cycles of acceptance and rejection. Beginning with Hoselitz (1954), the 
notion that urbanization in developing nations is too great given the sectoral 
(industrial) mix of their economies persists despite criticisms concerning the 
arbitrary nature of the concept and its inherent ethnocentrism (Sovani 1964, 
Breese 1966, Kamerschen 1969, Hawley 1981, Smith 1987). This persistence 
can be explained by the simple observation that, contrary to the development 
experiences of many Western countries, Third World urbanization does not 
seem dependent on industrial or manufacturing employment (Bairoch 1975, 
1988, Todaro 1981, Moir 1976). 

Although alternative conceptualizations include urbanization relative to 
labor absorption (Todaro 1981), the national capital endowment (Richardson 
19872), the ability to provide social services to the urban population (Gugler 
1982), and physical disamenities such as pollution, crime, and congestion 
(Teune 1988), more typically overurbanization has been viewed as the 
tendency in developing countries for urbanization to outstrip either industrial 
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or overall economic development. According to those most critical of this 
phenomenon, the structural condition of overurbanization and its growth lead 
to saturated urban labor markets, truncated opportunity structures in rural 
areas, overburdened public services, distorted sectoral development in nation- 
al economies, the isolation of large segments of the urban and rural population 
from the fruits of economic development, and retarded economic growth due 
to the high costs of urban development (Gugler 1982, Bairoch 1988, Gilbert 
& Gugler 1982, Lipton 1977). 

Overurbanization could be operationalized in a number of ways, but most 
researchers use either urban-to-industrial ratios (Bairoch 1975, Preston 1979, 
Kentor 1981, London 1987) or a residualized measure obtained by regressing 
the level of urbanization on the level of development (Timberlake & Kentor 
1983, Bradshaw 1985, 1987). Regardless of its operational definition, the 
concept remains arbitrary. It assumes that some optimal balance exists be- 
tween the level of urbanization and economic or sectoral development, a 
relationship between industrialization and urbanization that is somewhat 
ambiguous even in the historical experiences of western Europe (Sovani 
1964). Moreover, industrial employment as a percentage of the labor force 
has remained fairly stable in much of the Third World in this century despite 
rapid urbanization, suggesting that Western development may, in fact, 
represent the true anomaly (UNDESA 1980, Bairoch 1988). 


Urban Primacy 


Urban primacy has been defined as a condition where a single city dominates 
other cities in the intranational urban system or, more precisely, where one or 
a very few cities deviate substantially from log-normalcy in the national urban 
system in terms of population size. Since its early formulation, the concept 
has been modified to include the notion of two-city primacy (e.g. Brazil's Sao 
Paulo and Rio de Janiero), regional multiple-city primacy (e.g. India's urban 
system), and multicentric urban systems (where cities are more nearly equal 
in population size than would be predicted by the rank-size rule). Still, the 
urban systems of the developing world tend toward single-city primacy, and 
the list of such primate systems is long (e.g. Uruguay's Montevideo, Nige- 
ria's Lagos, Thailand's Bangkok, and Tanzania's Dar es Salaam). 

Urban primacy is burdened by both conceptual and practical difficulties, 
however. Foremost, primacy is thought to denote a lack of economic, politi- 
cal and social integration in any given system of cities, and for this reason it is 
critical that the boundaries of that city system be known (Walters 1985). 
While most past research relies on national political boundaries to define an 
urban system, the use of such boundaries is logical only if the internal urban 
system exhibits high closure. Cities that serve as links between their own 
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urban systems and others might be expected to be primate, and therefore the 
conceptual strength of the primacy notion is weakened (Vapnarsky 1969). 

Another difficulty with the concept is primacy relative to urban size and 
total population size. Are the problems associated with urban primacy the 
same in an urban system dominated by a city of 200,000 as they are in one 
dominated by a megacity? As Richardson (1987b) notes, urban planners and 
policy-makers often forget that the problems associated with primacy differ 
by population size, and in the rush to forge more evenly balanced urban 
systems fundamental needs such as scale economies may be overlooked. 

In terms of practical difficulties, the availability of city-specific data for 
many Third World nations also hinders research in this area. An early attempt 
by Davis (1969) using the four-city primacy index was restricted to only 35 
developing nations. El-Shakhs’ (1972) study also employed a four-city index 
(with aggregated urban residual populations as the fifth city), analyzing some 
75 national urban systems, many of them in developed countries. In a more 
recent longitudinal study of urban primacy, London (1987) constructed a 
two-city index for approximately 40 developing nations. While this served to 
maximize the number of cases in the Third World, the trade-off in terms of the 
epistemic correlation between the primacy concept and the operational defini- 

‘tion left much to be desired, as London admitted. 

Despite these difficulties, it is important to keep urban primacy in perspec- 
tive. For instance, it has been estimated that approximately 80% of pop- 
ulations in the developing world live outside of cities of 100,000 persons or 
more (UNDESA 1985). Also, some evidence suggests that for a select group 
of middle-income countries, deconcentration or Richardson's (1980) 
“polarization reversal" is occurring (Vining 1985, Rondinelli 1982, El- 
Shakhs 1982, Portes 1989). Nonetheless, urban primacy remains one of the 
principal topics of comparative urban studies for reasons similar to those 
associated with overurbanization. The lack of “balance” or “proper” city-size 
distributions is thought to represent both a symptom of underdevelopment and 
perhaps a structural constraint to future development through such mech- 
anisms as inadequate investment in rural areas, sectoral distortions of the 
economy, retarded fertility and mortality decline due to socioeconomic dis- 
articulation, and poor political integration (Lipton 1977, Gugler 1982, Arm- 
strong & McGee 1985). 

A related topic is the relative cost/benefit trade-off of megacities. To 
illustrate the point: in 1950 Shanghai was the only city in the Third World 
with 10 million or more residents. By the end of this decade, there will be 18 
such megacities with four of these each housing between 15 and 25 million 
people (United Nations 1989). As we shall see, a standard assumption of the 
modernization approach asserts that concentration and megacity growth are a 
function of development, and that very large cities help a nation to con- 
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centrate its meager financial resources efficiently until such time as resources 
and growth allow deconcentration and regional spread (Alonso 1980, Mera 
1973, El-Shakhs 1972, Wheaton & Shusido 1981). Alternative approaches 
note that agglomeration economies are important, but only up to 2 or 3 million 
urban inhabitants (Nakamura 1985, Henderson 1986, 1988, Carlino 1982, 
Montgomery 1988). Still other scholars, predominately from a world-systems 
or dependency approach, view megacities and urban primacy as important 
components of dependent capitalism which allow surplus value to be transmit- 
ted to the core (Armstrong & McGee 1985, Timberlake 1987). 

Although this issue cannot be resolved in the short space allotted here, it is 
interesting to note the following. First, although it can be argued that many 
megacities are fraught with environmental problems and inefficiencies, the 
fact remains that they tend to contribute disproportionately to their national 
economies. For instance, Shanghai produces 11.496 of China's industrial 
output and supplies 18% of the national government's revenue, although it 
receives far less proportionate investment in return (Murphy 1988). Similarly, 
Lagos provides 5096 of Nigeria's manufacturing value-added (McNulty & 
Adalemo 1988), while Mexico City generates one third of Mexico's gross 
national product (Schteingart 1988). Likewise, the Bangkok metropolitan 
area represents only 10% of Thailand's population, but contributes 86% of the 
nation's gross national product in producer services and 74% in manufactur- 
ing, while Sao Paulo, with 1096 of Brazil's population, contributes over 4096 
of industrial value-added and 25% of net national product (Rondinelli & 
Kasarda 1990). 

The statistics above illustrate that megacities are cradles of modern in- 
dustrial production. Moreover, as Mabogunje (1983) notes, megacities offer a 
far wider range of high-level goods and services, a potentially faster rate of 
technological innovation, and a focal point for accommodation between the 
diverse ethnic and religious groups of many Third World countries. They also 
provide huge concentrated markets for agricultural products and ideal en- 
vironments for informal sector activities. Finally, while no one doubts the 
severe disamenities bred by megacity growth such as crime, pollution, and 
congestion, the majority of Third World megacity migrants feel that their 
lives have improved as a result of their move (Dogan & Kasarda 19882). 

Concomitant with the concern over primacy and megacities is the emphasis 
placed on the integration of national space-economies. Specifically, many 
scholars have noted the lack of important secondary and tertiary cities in the 
Third World and the great disparities in regional and urban/rural socioeco- 
nomic development (Gilbert 1982, Lipton 1977, Rondinelli 1980, Bairoch 
1988). Given that agriculture and rural-based industry played such a large role 
in the industrial development of the West (Bairoch 1988), concern for region- 
al and urban-rural investment differentials seems warranted in international 
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studies. Rondinelli (1983, 1985) points out the critical functions that can be 
filled by viable secondary and tertiary cities, among which are the de- 
centralization of public services, the provision of a variety of household and 
consumer goods, the establishment of regional marketing centers and their 
linkages to large metropolises, the creation of new consumer markets, the 
creation of small- and medium-sized labor-intensive enterprises, and the 
diffusion of modernity. As with most policy prescriptions that sound poten- 
tially effective, however, deconcentration policies in the past have had a 
mixed-to-poor history of performance in bolstering regional and secondary- 
city development (Richardson 1987b, Shrestha 1987). 


Migration and Natural Increase 


The proximate determinants of urban growth can be grouped into three 
categories: (a) the natural increase of urban populations; (b) migration, both 
intranational (rural-to-urban and urban-to-urban) and international; and (c) 
boundary redefinition through annexation of surrounding areas. While bound- 
ary change may account for as much as 20% of Third World urban growth in 
recent years (Preston & Green 1985), most research has concentrated on the 
relative contributions of natural increase and migration. Of these two, rural- 
to-urban migration has been the principal focus of scholarship for two rea- 
sons. First, excessive rural outmigration fits nicely with several dominant 
theories of development and is thought to be responsive to rural adversity and 
the allure of modern wage structures in urban areas (Todaro 1969, Lipton 
1977, Gilbert & Gugler 1982). For this reason, many believe that migration is 
more easily modified by national policies. Second, the common belief that 
migrants to cities present greater challenges in terms of labor absorption and 
political stability naturally has led to the intense focus on this phenomenon 
(Bienin 1984, Herrick & Hudson 1981). 

In fact, no single definitive statement can be made about the proportionate 
demographic contributions to city-building. The United Nations UNDESA 
(1985) asserts that approximately 60% of urban growth can be attributed to 
natural increase and the remainder to migration, a finding more or less 
supported by empirical research (Rogers 1982, Preston 1979). On the other 
hand, this assertion is based on averages, and some evidence points to the 
relativity of migration's contribution to urban growth (Oliveira & Garcia 
1984, Newland 1980, Martine 1975). One complication is the fact that the 
relative contributions of natural increase and migration are conditioned by 
the initial level of urbanization. At low levels of urbanization, migration 
is the prime engine driving city growth, as is now the case in much of Africa. 
As the urban base grows, however, more and more migrants are required 
to match urban natural increase, and the pattern reverses itself (Keyfitz 
1980, Rogers 1984). Therefore, although natural increase is responsible for 
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most urban growth in the Third World, the age-selectivity of migrants 
(through their contribution to natural increase) as well as regional variations in 
urbanization and development complicate this picture. 

Structural forces that lead to rural-to-urban migration also vary by region. 
In particular, although some emphasis has been placed on the appeal of cities 
to potential migrants, considerably more attention has been devoted to the 
interaction between agrarian structures, rural adversity, and rural-to-urban 
migration. In Latin America, rural adversity arises primarily from a high 
degree of land inequality, a by-product emerging principally from colonial 
heritage but also due to the commercialization of agriculture and land specula- 
tion as a hedge against inflation (Furtado 1970, Oberai 1983, Wood, et al, 
1988, Berry & Cline 1979). In many regions of Asia, population densities on 
arable Jand lead to diminishing returns and excessive subdivision of peasant 
holdings. African agriculture, while not burdened with shortages of land, is 
hampered by the low carrying capacity of soils, biological pests that spread 
disease, traditional production methods, and problems in articulating labor 
supplies with seasonal requirements (Todaro 1989). Regardless of regional 
variations, these problems can lead to landlessness, underinvestment, and 
outmigration from rural areas. 

Many other push factors that influence cityward migration can be enumer- 
ated. The import-substitution strategies of many Third World nations siphon 
off agrarian surplus to fund industrial development in urban areas through 
such mechanisms as pricing differentials that disadvantage small agricultural- 
ists and the overvaluation of national currencies, which lowers the costs of 
capital-intensive imports but makes primary commodity exports less competi- 
tive in the world market (Lipton 1977, Oberai 1983). Moreover, agricultural- 
ists in the small-farm sector suffer from monopsonistic market configurations, 
a severe lack of marketing information, the lack of storage and transport 
facilities, the inability to obtain credit through formal channels, and the lack 
of vocational/technical training in rural areas (Rondinelli & Ruddle 1978). 
Therefore, even when land is available in sufficient quantity and quality, the 
combined influence of rapid population growth and poor market infrastructure 
and supporting institutions conspires to push many from the land toward 
cities. 

Although basic causes seem straightforward, migration is of a highly 
variable nature. Although most demographers and urbanists have focused on 
rural-to-urban migration, recent evidence points to a much more complex set 
of migration streams. Urban-to-urban migration, rural-to-rural migration, 
and, at early stages of development, circular migration are among the types of 
migration found to predominate in many Third World countries (Thomas & 
Mulvihill 1980, Deere & Leon de Leal 1982, Corner 1981, Benoit 1982, 
Russell & Jacobsen 1988, Findley 1987, Goldstein 1987). 
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Once again, the type and rate of migration depends on region and level of 
development, among other things, a fact borne out by such examples as Sao 
Paulo, where one third of growth can be attributed to urban-to-urban migra- 
tion (Faria 1988), and many nations of west Africa, where circular migration 
has been very common (Gugler & Flanagan 1978, Colvin et al 1981). Further 
complexities arise in considering the composition of migration streams. At 
least two distinct flows appear in research, one flow of the reasonably well-off 
in search of urban opportunities in employment and education, and another of 
the poor and landless in search of subsistence. Moreover, idiosyncracies in 
migration flows occur due to such factors as the availability of secondary 
employment in nearby cities and towns, caste or ethnic connections that 
predispose certain groups to migrate to specific cities, the configuration of 
roads and the accessibility of transportation, and the institutionalization of 
rural-to-urban migration as a part of the life course (Connell et al 1976). 
Nonetheless, few doubt that most rural-to-urban migration is used by house- 
holds as a survival strategy, although the poorest segment of rural populations 
may find the cost of such strategies prohibitive. 

Given the importance of international migration in the development of 
Europe, the influence of migration across national borders on urbanization 
and agrarian structures should not be overlooked. Developed nations such as 
the United States, France, Germany, and Switzerland have served as destina- 
tions for millions of Third World citizens from Mexico, Turkey, and many 
nations of north Africa, and more elite streams of immigrants from around the 
globe. Furthermore, a few Third World countries such as the Ivory Coast, 
Kuwait, Saudi Arabia, Libya, Ghana, Zaire and Gabon play host 
to citizens of nearby nations. Although these international migration streams 
are small in comparison to historical European migrations, they do relieve 
some pressure on many Third World cities and provide important sources 
of capital in the form of remittances (Martin & Houstoun 1979, Cornelius 
1976). 

Massey (1988) presents a theory of international migration which posits 
that forces of development and modernization combine to destroy traditional 
rural life in Third World countries, resulting in a constant impetus to emigrate 
from rural areas. Variables that determine international migration include the 
degree of economic integration between origin and destination, the existence 
of transportation routes and information flows, and the degree of recruitment 
of labor on the part of the more developed partner. Economic conditions 
determine whether the potential migrant selects an internal or external destina- 
tion. Such flows of international migrants are likely to become perpetual due 
to the flow of information and chain migration within migrant networks, the 
changes brought about in land holdings in sending communities due to 
emigrant capital flows, and renewed international migration as a response to 
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feelings of relative deprivation on the part of families that have not shared in 
capital earned abroad (Stark & Taylor 1989). 

Although international labor migration can increase inequality in sending 
communities, it also can have the dual benefit of providing capital for 
consumption and investment as well as relieving some pressure on over- 
burdened Third World cities (Cornelius 1976). However, given the small size 
of such migration streams relative to overall population growth and the 
barriers to international migration erected by the United States and other 
developed regions, it is unlikely that international migration will serve the 
same function that it did for Europe a century ago (Martin & Houstoun 1979). 


Job Creation and the Urban Informal Sector 


No other topics in the study of Third World urbanization have generated as 
much recent research as job creation, labor absorption, and the urban informal 
sector. Considering that the labor force in developing countries is expected to 
increase from 1.8 billion currently to 3.1 billion by the year 2025, job creation 
has become an overriding concern (Rondinelli & Kasarda, 1990). Because of 
this, it is hoped that informality can provide an alternative path to job creation 
and social mobility (Tokman 1989). 

The major difficulty in studying informal employment and job creation 
rests in the definition of the urban informal sector. Early attempts by the 
International Labor Organization (ILO) to define informality attributed such 
characteristics as ease of entry, small scale, internal finance and resources, 
family-ownership, labor-intensive production, the use of local technologies, 
and unregulated competitive markets to informal sector activities (ILO 1972). 
Other researchers have defined the sector with stress on one or more of these 
characteristics, such as the type of technology used and competitive markets 
(Souza & Tokman 1976), the dichotomy between "protected" and "un- 
protected” wage structures in the urban economy (Squire 1981), the level of 
organization and its relationship to the state (Sethuraman 1976), or the degree 
to which informal enterprises obey laws related to wages, permits, taxation, 
and zoning (de Soto 1989). So far as characteristics can be gleaned from this 
welter of concepts, it appears that informality can be defined along four 
continuums: (a) size of firm, (b) sources of financial and material resources, 
(c) the degree of linkages between the regulated and unregulated sectors of the 
economy, and (d) the extent to which firms comply with government regula- 
tions. 

As general postulates, the smaller the enterprise, the more internalized its 
sources of working capital, the more tenuous its links to more formal enter- 
prises, and the less it complies with legal requirements, the more "informal" it 
becomes. Kannappan (1983, 1985) extensively reviews research on urban 
labor markets in the Third World, demonstrating that very little evidence 
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exists for dividing urban employment structures into a formal/informal dual- 
ity. Formal and informal wage increases correspond closely, wage structures 
of the two sectors do not display sharp disparities, migrants are not at a 
disadvantage in terms of urban employment, and ascribed status, region of 
origin, and other personal characteristics are used by employers as hiring 
criteria in both modern and "traditional" sectors. These considerations lead to 
the rejection of dualistic labor market theories and suggest that the urban 
economies of Third World nations are very heterogeneous and adaptive. 
Furthermore, recent research leads away from notions about benign or ex- 
ploitative relationships between sectors of an urban economy and encourages 
the adoption of a model of urban labor markets characterized by complex 
interdependencies and dynamism. i 

Although operational problems persist, Sethuraman (1981) estimates that 
informal employment accounts for one third to one half of urban employment 
throughout most of the Third World. In addition, it appears that informal 
employment is increasing faster than formal employment in many developing 
countries (de Soto 1989, Richardson 1984). However, some scholars note that 
formal and informal sectors expand and contract together, and that recent 
austerity measures in certain heavily indebted Third World countries have ' 
dimmed the prospect of a robust, expanding informal sector, in essence 
reducing informal sector employment to subsistence activity (Roberts 1989, 
Portes 1989). If this is the case, it provides even more evidence that Third 
World urban labor markets are not sharply dualistic, that sectors are mutually 
interdependent, and that previous conceptualizations of the informal sector as 
"residual" or “make-work” employment were less than accurate. 

Indeed, recent research has significantly modified or reversed many of our 
previous conceptions of what the informal sector is and how it works. First, 
far from being a marginal and entirely separate sector of production, the 
informal sector is extremely heterogeneous. In fact, many researchers have 
dichotomized this sector into the intermediate subcontracting segment which 
is highly linked to the formal sector and a more marginal informal sector less 
integrated into the urban economy (Connolly, 1985, Portes & Benton 1984, 
Toure 1985, Safa 1986, House 1984). Although arguments about the ex- 
ploitative relationship between formality and informality continue (Portes & 
Sassen-Koob 1987, Portes & Walton 1981, Bromley 1982, Bromley & Gerry 
1979), others have found that formal-informal linkages appear to be symbiotic 
and mutually beneficial for the most part (House 1984, Teilhet-Waldorf & 
Waldorf 1983, de Soto 1989). 

A second misconception corrected by recent research is that the informal 
sector consists mostly of recent migrants, and that poverty characterizes 
workers in this sector. While it is true that many informal sector workers, and 
particularly employees of informal sector firms, are migrants, many migrants 
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also gain initial employment in the formal sector (Toure 1985, Oliveira & 
Garcia 1984). Conversely, many indigenous city dwellers are found in the 
urban informal sector (House 1984, Teilhet-Waldorf & Waldorf 1983, 
Mazumdar 1981). Just as it is not valid to characterize the vast majority of 
rural-to-urban migrants or informal sector workers as unskilled and un- 
educated (Linn 1983, Herrick & Hudson 1981, Preston 1979), it is likewise 
ill-advised to assume that low-capital, labor-intensive industries are staffed 
almost exclusively by recent migrants. 

Third, although it is likely that the 10 to 20% of the workforce employed by 
the formal sector in most Third World cities enjoys a degree of social 
mobility, the notion that informal sector activities lead to social immobility is 
simply not warranted. The informal sector is not employment of last resort 
(Portes & Benton:1984, Richardson 1984, Findley 1990), and evidence exists 
that informal wages, particularly among entrepreneurs, can be at least as 
remunerative as formal sector wage employment. Also, there are often returns 
to human capital and opportunities for the accumulation of wealth in the 
informal sectors of many Third World cities often not found in formal sector 
employment (Connolly 1985, Mohan & Hartline 1984, House 1984). Thus, 

' there is evidence of formal sector workers voluntarily choosing informal 
sector occupations (Teilhet-Waldorf & Waldorf 1983, Peattie 1982a, Arias & 
Roberts 1985, Roberts 1976). 

In general, then, informal workers are better educated, earn more money, 
and enjoy greater’social mobility than was thought possible before the current 
round of case studies. These generalizations should not be overstated, howev- 
er. Most Third World cities experience unemployment above 1046 of the labor 
force (Todaro 1989), and without doubt the most “informal” of workers such 
as street vendors, bootblacks, or certain classes of employees do suffer 
various degrees of absolute poverty. Nonetheless, the picture of the urban 
informal sector which emerges from recent literature is considerably more 
positive than was previously the case. In terms of labor absorption, subsidy- 
free human capital formation, low-cost provision of goods and services to the 
urban poor, and the retention of scarce capital, the urban informal sector is far 
more productive than formal sector employment (de Soto 1989, Richardson 
1984). In fact, some believe that recognition of such activities by the state and 
the resultant access to formal credit could reduce the dependency of the 
informal sector on formal activities and lend it the leading economic role in 
most developing countries (Chaudhuri 1989, de Soto 1989). 


Housing and Internal Spatial Form 


Through a process of organized and unorganized invasions of urban real estate 
and illegal subdivision and sale of land, many cities in the developing world 
are dotted with non-standard, poor-quality housing units interspersed with 
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sanctioned land uses. Because these settlements rest on land typically un- 
serviced by public utilities and infrastructure, urban planners and city officials 
have been concerned for public health, political unrest, and poor urban 
development (see Peattie & Aldrete-Haas 1981 for a review of informal 
housing). 

The importance of informal housing to Third World cities follows from its 
important contribution to shelter. It is estimated that between one fifth and 
one half of all urban residents in the Third World dwell in informal shelters, 
and this figure increases to over 70% for some cities such as Casa Blanca, 
Ibadan, and Addis Ababa (UNCHS 1982, McTaggart 1988, Ghosh 1984). 
Moreover, projections indicate that the developing world will need to create 
an additional half-billion housing units to shelter its burgeoning populations 
between now and the year 2000, figures that suggest no alternative to the 
growth of informal settlements (Palmer & Patton 1988). 

While it is clear that rapid natural increase and large streams of inmigrants 
encourage land speculation and inflate urban land prices in the Third World, 
the notion that informal settlements are housing of last resort is inaccurate. 
Often, informal homeowners view the movement to informal settlements as a 
move up and out of squalid rental quarters or alternative living arrangements 
(Alexander 1988, de Soto 1989). In addition, while some evidence suggests 
that informal housing is dominated by migrants (Hollnsteiner 1972, Muwonge 
1980), others have pointed out that such social patterns as chain migration, 
boarding with relatives, the use of rental housing upon arrival in the city, and 
live-in servitude create a pattern of residential mobility that starts with 
employment and culminates in private, informal homeownership (Costello 
1987, Karpat 1976, Alexander 1988, Linn 1983). Moreover, much research 
points to the viability of informal settlements as communities, possessing 
complex interdependencies composed of economic and social cooperation 
(Beinin 1984, McTaggart 1988). 

The picture emerging from case studies does not suggest that informal 
settlements can be characterized as defeated, socially disorganized neighbor- 
hoods, but rather as vital, if oftentimes poor, communities. Although social 
problems such as limited public health measures, poor access to transportation 
and social services, and tenuous land tenure persist in most informal settle- 
ments in the Third World, the magnitude of housing demand and the in- 
adequacy of public housing starts suggest that only remedial schemes such as 
site-and-service measures are likely to meet with success. Yet, even these 
measures may cause harm through the destruction of informal construction 
activities and the tendency for serviced land to inflate in price, thereby 
excluding large segments of the population from the urban land market 
(Peattie 1982b, Hardoy & Satterthwaite 1981). 

Urban morphology is another important consideration. While the internal 
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spatial structure of Third World cities is highly variable (Lowder 1987), urban 
form tends to converge with North American and European patterns as 
industrialization accelerates. However, this convergence is imperfect due to 
social stratification systems based on ascribed status (i.e. ethnicity and reli- 
gion), the myriad regulations of Third World land markets that hamper market 
exchanges, government regulations such as elite housing subsidies and rent 
control, and the existence of preindustrial physical stock such as urban 
cantonments and religious centers. Another complication to this spatial pat- 
tern is the existence of informal settlements. Although most commonly found 
on the urban periphery (where public land is more available and less fiercely 
protected), informal housing tends to fill up the interstitial niches between 
formal land developments (Lowder 1987). For this reason, it is not atypical to 
find squatter settlements located in areas of high disamenity (i.e. abandoned 
industrial sites, hills) nestled between middle-class housing and commercial 
structures. While this tends to degrade the visual image of many Third World 
cities, it also serves to bring diverse social groups into daily contact and to 
provide better access to employment for the pedestrian poor (Portes 1989). 


THEORIES OF URBANIZATION 


Space constraints preclude a thorough discussion of every theory of urban 
development in the Third World. For this reason, we will present con- 
stellations of theoretical thought categorized by the institutional frameworks 
they stress. Although the presentation of theories as constellations focuses on 
similarities rather than differences, each theoretical family is fairly cohesive 
and lacks irreconcilable contradiction. 


Modernization/Ecology 


Three concepts are central to this set of theories. First, the present state of 
urbanization and development in any given society cannot be divorced from 
its initial state at the onset of modernization. Recent research reminds us that 
societies begin with a differential set of institutional and infrastructural pat- 
terns derived from past technological-economic capacities. For instance, 
traditional societies that developed agriculture possessed the surplus to create 
a variety of social patterns such as urban systems, monied economies, govern- 
ment bureaucracy, and literacy, all traits that may ease the necessary dis- 
orientation bred by the intrusion of modernity (Lenski & Nolan 1984, Nolan 
& Lenski 1985). The second assumption of these theories is that technology is 
fundamentally more important than a society’s social organization. Indeed, 
although social organization does create innovation and technology, the 
application of technology is the prime engine of social change. Thus, scholars 
working in this tradition tend to view industrialism rather than capitalism as 
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having a unique impact on Third World societies. Finally, the modernization/ 
ecological approach asserts that cultural diffusion, despite breeding inevitable 
social disequilibria (uneven development), will eventually force a rough 
convergence of First World and Third World development patterns. Although 
the theory makes provision for the resistance of traditional mindsets, the 
constraints of previous social organization (e.g. colonial patterns), the diver- 
sity and possible antagonisms inherent in ethnic, racial, or religious divisions, 
and inappropriate or corrupt politics, technology is viewed as possessing a 
systems logic that eventually overrides older forms of social organization and 
replaces them with its own institutional matrix (Kerr et al 1960, Hawley, 
1981). 

Modernization/ecological theory explains Third World urbanization with 
reference to the modern/traditional economic dichotomy and demographic 
transition theory. According to this view, city-building in the Third World can 
be attributed to cultural lag in equilibrating fertility and mortality differentials 
and massive rural-to-urban migration stemming from rural-push and urban- 
pull factors (Berliner 1977, Spengler & Myers 1977, Hawley 1981). Two 
principles are involved here. First, although mortality-reducing technology is 
easily diffused throughout most Third World nations, this is not the case for 
the institutional/industrial web of modernity in general. For a variety of 
reasons associated with agglomeration economies and the inertia of in- 
frastructural development, Third World nations at the onset of modernization 
concentrate investment and therefore population into one or a few large cities 
(Alonso 1980). Constrained opportunities in rural areas and the allure of 
modern cities lead to rural-to-urban migration. Moreover, the age-selectivity 
of migrants and an overall high rate of fertility swell urban areas beyond their 
capacities to provide infrastructure, housing, social services, and employ- 
ment. This approach treats these conditions as a temporary mismatch between 
demographics and economics, however. The belief is that as modern in- 
stitutional forms, technologies, and mindsets diffuse out from urban centers, a 
process of urban/hinterland, intraregional and interregional integration 
occurs, eventually smoothing the course of social change and ameliorating the 
worst inequalities. 


Dependency/World-Systems 


The unifying concept shared by all dependency and world-systems theorists is 
that either through intentional coercion or through the inherent logic of 
capitalism certain areas and their populations have been “underdeveloped” 
(Frank 1967, Galtung 1971, Wallerstein 1974, 1980, Timberlake 1985b). 
Major theoretical statements concerning dependency and dependent develop- 
ment have been made by numerous scholars, making a single concise review 
of thought in this area extremely difficult (Baran 1957, Frank 1967, Evans 
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1979, Wallerstein 1974, 1980, Cardoso & Faletto 1979, Timberlake 1985, 
1987, Bornschier & Chase-Dunn 1985). Although the shift to critical analysis 
has been reviewed elsewhere (Hermassi 1978, Chirot & Hall 1982), at a more 
basic level this constellation of theories rests‘on three assumptions. First, 
these critical theorists assume that a uniquely capitalist development pattern 
exists, in essence asserting that capitalism is a unique form of social organiza- 
tion. Second, dependency/world-systems theorists assume that capitalism 
requires a certain social structure, which is characterized by unequal ex- 
change, uneven development, individual social inequality, core-periphery 
_ hierarchies, and dominance structures. Where these features are not built into 
the social structure, these theorists assume that capitalistic development will 
generate them. Finally, with the possible exception of the physical environ- 
ment, world-systems and dependency theorists model social organization, 
technology, and population dynamics as endogenous facts in development 
and urbanization (Timberlake 1987). Typical ecological variables are only 
proximate determinants of urbanization, which in turn are dependent on the 
type, rate, and depth of interaction with the capitalist world system. 

Thus, dependency and world-systems theorists view the structures and 
processes of the capitalist world system as the prime causes of social change 
in the Third World. Overurbanization is seen as a function of rapid natural 
increase, rural-to-urban migration, and agrarian adversity, but these determi- 
nants are created by the heritage of colonialism, capital-intensive foreign 
manufacturing, enclosure of agricultural lands for export production, and the 
economic stagnation bred by the oligopolistic practices of multinational cor- 
porations (Timberlake 1987, London 1987, Bornschier 1981). In like manner, 
the existence of informal employment is a direct function of political repres- 
sion, overpopulation, and struggling economies, but these intermediate pat- 
terns are generated by the capitalist need to subsidize formal sector workers 
and to extract greater shares of surplus value (Portes 1985). From this 
perspective, the nations and regions of the globe are interconnected through 
an urban network, the “world cities” of the developed world using the primate 
capitals of the Third World as accumulators and transmitters of wealth. In 
turn, these primate capitals skew development of hinterlands in order to 
increase capitalist profitability and to reinforce the highly unequal social 
structure required by capitalist systems (Cohen 1981, Douglass 1988, Arm- 
strong & McGee 1985). 


Distributive Coalitions/Urban Bias 


Political theories such as Olson’s (1982) distributive coalitions theory and 
Lipton’s (1977) urban bias theory shift the emphasis from economic elites to 
political elites. Both theories make three basic assumptions. First, political 
institutions by virtue of their monopoly on coercion are inherently more 
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powerful than other institutions in shaping development. While it is true that 
social (e.g. religious/ethnic) and economic elites wield great power, their 
abilities to gain hegemony over a nation's future depend on capturing and 
holding the political machinery. Second, these theorists posit that coalitions 
organized around shared self-interest combine either to exert pressure on the 
state or to dominate it, and the express purpose of these activities is to 
generate laws that divert social surplus to special interests. Third, these 
theorists assume that such coalition-building is not the sole property of any 
particular economic regime. Indeed, theorists employing this approach would 
expect to see distributive coalitions and spatial and social bias in public policy 
wherever government involvement in social and economic affairs has been 
consistent and extensive. 

Although the precepts discussed above pertain specifically to distributive 
coalitions theory, urban bias theory can be considered a special case of these 
general principles. According to Lipton (1977), the elites and most powerful 
interest groups of Third World nations rely primarily on urban resource bases, 
and for this reason most macro- and microeconomic policy initiatives favor 
the overdevelopment of urban areas and the underdevelopment of rural areas. 
Specifically, import-substitution strategies have been pursued by many Third 
World nations in an attempt to develop indigenous capital-intensive industry. 
To finance the costly importation of capital-goods necessary for such produc- 
tion, such inputs are artificially cheapened by overvaluing currency. The 
effects on primary commodity exports have been noted above. In addition, 
domestic prices for food and other rural inputs necessary to urban living have 
been artificially lowered by government edict and/or the monopsonistic mar- 
ket practices of urban-based landlords, in essence subsidizing the urban sector 
at the expense of the agrarian sector. Although several other such mechanisms 
are commonly practiced in the Third World, the point here is that many social 
ills are the result of economic exploitation of agriculture, exploitation made 
possible by government intervention. Also, Lipton (1984) notes that severe 
discrimination against rural areas exists in both market and non-market 
economies, dispensing with the notion that urban bias is somehow a unique 
product of capitalist economics. 

Although support for the effects of urban bias has come from a variety of 
sources (Lipton 1973, 1977, 1984, London & Smith 1988, Bradshaw 1987, 
Ellis 1984, Nolan & White 1984, London & Anderson 1986, Todaro 1981), 
the real contribution of these theories is the emphasis on the state as an 
independent actor in development and urbanization. It is fair to say the 
socioeconomic theories have neglected the role of the state, whereas with a 
few important exceptions (e.g. Rubinson 1977, Delacroix & Ragin 1981, 
Moon & Dixon 1985) dependency and world-systems theory has viewed the 
state as an extension of global capitalist processes (Skocpol 1977). 
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Critique 


Our brief overview of major theoretical groups does some injustice to all of 
them, but it does illustrate the basic parameters of each group. While synthe- 
sis is possible and has been attempted in limited ways (Meyer 1986, Nemeth 
& Smith 1985a, Kennedy & Smith 1989, Bradshaw 1987, London 1987), 
such marriages of theoretical families may not be particularly happy unions. 
First, modernization/ecological theory.does not stress class relations or 
capitalism per se, but rather the inevitable tensions created by the shifts in 
social organization encouraged by industrialism. The complete metamorpho- 
sis of any economic regime is difficult for the populations involved, as is 
evidenced by the social problems that characterized the Industrial Revolution; 
the social problems of the Third World are likely to be worse for two reasons: 
the existence of developed countries, and changes in the global situation. This 
should not be misinterpreted. The exploitation of the periphery by the core 
may not be the primary cause of such phenomena as overurbanization, 
primacy or sectoral distortion. Rather, it is the contact between developed and 
developing nations that breeds social problems in the short-term. The lack of 
new territory to relocate surplus population, the desires of Third World 
populations to enjoy the fruits of modernity, and of course the search for new 
markets by core capitalists and new allies by the socialist world have acceler- 
ated both the process of social change and its attendant social problems. 
However, the idea that this accelerated development was specifically de- 
signed or built by the logic of a particular economic form or that chronic 
overurbanization, primacy, poor economic growth or dependent relations 
must become perpetual is rejected by most scholars using the modernization/ 
ecological approach. 

Second, most modernization/ecological theorists have trouble understand- 
ing the phrase "uneven development." While critical scholars view lopsided 
economic progress and "social disarticulation" as the sources of many Third 
World ills and as direct by-products of capitalist development, modernization/ 
ecological theorists find the counterfactual, or "even" development, a rather 
naive concept. Cultural diffusion is always uneven both socially and spatially, 
such theorists think, and somehow forcing a uniform spread of development 
activities might actually prove counterproductive and therefore harmful to 
human welfare in the long-term. 

Third, modernization/ecological theorists view social inequality as an in- 
herent property of social organization. In essence, social organization is in: 
part inequality. Ecological theorists assert that while the characteristics of 
elite groups may vary according to their connection to key functions, the 
existence of elites or their tendency to be self-serving is invariable (Lenski 
1966). Therefore, most modernization/ecological theorists question the latent 
assumptions of dependency/world-systems theory that national autarchy 
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might prove more beneficial to the Third World or that some alternative 
political-economic regime might be inherently superior because of its differ- 
ing pattern of social stratification. For instance, the existence of "informal" 
sectors in non-capitalist nations (Dallago 1987, Sit 1983) and the similarity of 
urban problems and patterns between the East and West (Friedrichs 1988) 
suggest that the determinants of urbanization may be more fundamental than 
those posited by dependency/world-systems theorists. 

On the other hand, some have noted that the socialist and capitalist urban 
systems do differ in important ways, and more research needs to be done on 
this topic (Bradshaw & Fraser 1989, Kennedy & Smith 1989, French & 
Hamilton 1979, Stark 1989). Moreover, as modernization/ecological theorists 
seek to become less naive themselves, a dependency/world-systems theory 
that has made developmental theory sensitive to international/'economic con- 
texts should benefit all theories. Certainly no recent work can afford to ignore 
the contributions of these recent critical approaches to Third World urbaniza- 
tion and development, and because most major theories of development are 
materialistic and macrosocial in character, there is hope that theoretical 
schisms can be overcome, deemphasizing rhetoric and facilitating the return 
to normal science (Hawley 1984). 


THE STATE OF EMPIRICAL ANALYSIS 


This portion of our review is meant to serve three purposes. First, we 
highlight quantitative analysis and multivariate models in particular. Second, 
we relate findings to existing theories. Third, our assessment is meant as a 
guide to future research. Because we find problems of underspecification so 
severe, we hope this assessment will sharpen future models and thus lend 
them additional rigor. 


Development Indicators 


For a variety of reasons, we should expect development to be positively 
related to urbanization. As a society generates more and more surplus, 
economic activities shift to higher-order production techniques, techniques 
that are partially dependent on the spatial concentration of related economic 
activities, labor, and consumer markets. Moreover, trade between societies 
has always been a factor in economic development, and linkages between 
societies are facilitated by concentrated nodes of economic and social activity 
(Tolley 1987). Indeed, one unequivocal fact about urbanization is its relation- 
ship to the level of development. As a general principle, the higher the level 
of development as measured by the percent of the labor force in industry or 
gross national product per capita, the higher the level of urbanization (Preston 
1979, Bairoch 1988, Pernia 1988, Moir 1977, London 1987). Moreover, it 
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appears that rapid economic growth contributes to urbanization (Becker & 
Morrison 1988, Preston 1979). Indeed, one of the common measures of 
overurbanization, the residual produced by regressing the level of urbaniza- 
tion on gross national product per capita, would be invalid if the relationship 
between urbanization and the level of development were not linear and fairly 
tight. 

The relationship between development and relative urbanization is not so 
consistent. In the case of urban primacy, El-Shakhs (1972), Wheaton & 
Shusido (1981), DeCola (1984), and Petrakos & Brada (1989) found that the 
relationship between primacy and development is curvilinear (rising primacy 
from low to intermediate levels of development, and declining thereafter). 
One study restricted to developing countries found a positive relationship 
between development and primacy (London 1987), whereas other studies 
have found either a negative relationship between the two or no relationship at 
all (Rosen & Resnick 1980, Mehta 1969, Sheppard 1982, Berry 1971, Mutlu 
1989; Frey et al 1986). Differences in samples, operationalizations, time- 
frame, and models preclude confident generalizations at this point, but it does 
appear that empirical and theoretical evidence suggests a maturation effect 
among urban systems as posited by modernization/ecological theory. 

On the other hand, Smith (1985) notes that such cross-sectional evidence 
using both developed and less-developed nations does not provide sufficient 
evidence of a maturation process. Specifically, although economic growth has 
occurred in much of the Third World, the developmental gap between the 
First and Third World has been fairly stable, suggesting that the current 
distribution of primacy across developmental strata may reflect a fixed rather 
than dynamic set of social phenomena. 

Less research has been conducted on the relationship between over- 
urbanization and the level of development. Of the handful of empirical studies 
on overurbanization and its growth, only one found a negative relationship 
between development and overurbanization (Moir 1977), while others have 
found no clear statistically significant link between the two (Kentor 1981, 
Bradshaw 1987, London 1987, Crenshaw 1990). It appears that this phenom- 
enon may not be a function of development, at least in the case of Third 
World countries. 

The shift to service-sector economies in developed countries has been 
widely noted, so it is not surprising that this relationship has also been 
examined in a Third World settine. Moreover. tertiary emplovment has heen 














used in the past as a rough proxy for the informal sector. Although a few 
studies have dealt specifically with the tertiary sector and its determinants, 
some have not reported findings on the effects of development (Kentor 1981, 
Bradshaw 1985, 1987). Three studies found no statistically significant 
relationship between the two variables (Timberlake & Kentor 1983, Fiala 
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1983, Crenshaw 1990). However, two recent studies assert that tertiary 
employment and its growth are positively related to the level of development 
(Semyonov & Lewin-Epstein 1986) as well as to other modernization in- 
dicators such as the percent urban and life expectancy (Thompson & Stollar 
1983). 


Demographic Indicators 


As noted by Pernia (1988) and Preston (1979), natural increase is positively 
related to urban population growth. In addition, high levels of urbanization 
dictate somewhat lower rates of growth in urbanization due simply to the fact 
that equilibrium is reached within the population, or to a ceiling effect 
(Brueckner 1990, Crenshaw 1990). The other demographic factor influencing 
urbanization and urban population growth is migration. Rogers (1982) points 
out that while natural increase explains most urban population growth, high 
levels of rural-to-urban migration tend to accelerate urbanization. In fact, 
unless urban fertility rates exceed rural rates, it can hardly be otherwise. 

The interactions of population and economic contexts may constitute im- 
portant determinants of urban growth and urbanization as well. Firebaugh 
(1979) and London (1986) note that the density of the rural population is 
related to cityward migration, as is the system of land tenure and certain types 
of agriculture (e.g. plantation agriculture). Constraints on land ownership and 
rural overpopulation tend to push surplus populations toward cities (Bairoch 
1988), and the concentration of non-farm activities in Third World cities 
suggests rural underdevelopment as an additional impetus to rural-to-urban 
migration (Firebaugh 1984). 

Demographic variables associated with urban primacy are (a) population 
growth rate (Berry & Kasarda 1977, DeCola 1984); (b) ethnic composition 
(Mutlu 1989), and (c) population size (Mutlu 1989, Sheppard 1982, Rosen & 
Resnick 1980, Mehta 1969, London & Anderson 1986, DeCola 1984) and a 
related variable, territory size. It seems clear that rapid population increase 
exacerbates agrarian adversity and reflects natural increase within cities, 
promoting urban primacy. Moreover, Mutlu (1989) posits that ethnic 
heterogeneity leads to political pressures that encourage more balanced in- 
vestment patterns by national governments, resulting in greater development 
of secondary cities and regional capitals. In addition, although the effects of 
agricultural density are still in doubt, part of the reason that agricultural 
adversity might display an inconsistent pattern across nations is simple: 
Agricultural lands have differential carrying capacities related to soils, cli- 
mates, technologies, and crops, and because agricultural densities are relative 
to carrying capacities they may not be of much value in cross-national 
research. Moreover, it is likely that small countries with small populations 
exhibit higher primacy than larger countries. Small size does not lend a nation 
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the resources or population to build an economically efficient system of cities; 
agglomeration economies may dictate concentration. 

The effects of demographic variables on overurbanization and tertiary 
employment are relatively unknown. While the level and growth of over- 
urbanization are thought to inflate the tertiary sector (Kentor 1981, Bradshaw 
1985, 1987), Crenshaw (1990) has explicitly modeled the growth of over- 
urbanization, tertiary employment, and sectoral hypertrophy as functions of 
population size and growth in the belief that very large and rapidly growing 
populations inundate the cities and urban labor markets of Third World 
countries. It appears that population size is positively related to the growth of 
overurbanization and tertiary employment, while rapid population growth 
tends to encourage sectoral hypertrophy. 


Sectoral-Imbalance Indicators 


Sectoral imbalance, or a high degree of inequality between economic sectors, 
is thought to promote high levels of rural-to-urban migration through the 
creation of wage disparities, which in turn accelerates urbanization and 
relative urbanization (Todaro 1989). Although we can be fairly certain that 
differentials in economic opportunities between rural and urban areas do play 
a major role in rural-to-urban migration, the contribution of sectoral inequal- 
ity to urbanization is far from certain. Some research suggests that wage 
differentials between manufacturing and agriculture are prime determinants of 
urbanization (Kelly & Williamson 1984a,b, Brueckner 1990), while others 
note that rural adversity tends to depress rural-to-urban migration (Becker & 
Morrison 1988, Preston 1979). Indeed, it is clear that new migrants to Third 
World cities are not the poorest segment of the rural population, and while 
higher urban wages may attract these migrants, urban amenities may also 
promote migration (see London & Anderson 1986 for the effects of urban 
amenities). In addition, some have suggested that the wage structures induced 
by modern capital-intensive manufacturing are not the principal inducement 
to migration, but rather government employment, informal sector employ- 
ment, education, and simple rural push factors (Becker & Morrison 1988, 
Henderson 1986, Connell et al 1976). 

Other studies point out that rural adversity is positively related to the 
growth of urbanization and primacy (Petrakos & Brada 1989, Firebaugh 
1979, London 1986, 1987). In addition, urban bias as measured by sectoral 
productivity differentials is related to the growth of primacy (London 1987) 
and overurbanization (London 1987, Bradshaw 1985, 1987). Similarly, the 
Gini coefficient of sectoral inequality appears to be related to the growth of 
urbanization, primacy, and overurbanization, but as with agricultural density, 
these last two effects are strong only when African nations are excluded from 
the sample (London 1987). Finally, Mutlu (1989) has found that income 
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inequality is positively related to primacy, given that such inequality may be 
more severe in rural areas and may provide the stimulus for rural-to-urban 
migration. 

The link between sectoral and social inequality and the growth of the 
tertiary sector is less clear. While some assert that the growth of tertiary 
employment exacerbates inequality (Evans & Timberlake 1980), others point 
out the more logical position that, given the limited labor absorption of 
modern industrial sectors in urban areas, tertiary employment provides better 
opportunities than may be available in rural areas, therefore helping to 
decrease income inequality (Fiala 1983). Moreover, Bradshaw (1985, 1987) 
found that sectoral productivity differentials (urban bias) are positively related 
to hypertrophy of the tertiary sector, but only indirectly through their impact 
on overurbanization. 


Theory-Specific Indicators 


In terms of modernization/ecological indicators, it is fairly certain that trans- 
portation networks, the location of cities (ports, etc), the location of govern- 
ment services, and the physical characteristics of the urban hinterland affect 
the level and rate of growth of urbanization and urban primacy (Preston 1979, 
Bairoch 1988, Dogan 1988, DeCola 1984, Rosen & Resnick 1980, London 
1986, Henderson 1982, 1986). In addition, there is some indication that 
higher human capital formation in rural areas leads to higher levels of 
rural-to-urban migration, although this is conditioned by opportunity struc- 
tures in rural areas (Bairoch 1988, Henderson 1986, Connell et al 1976). 

A small but growing literature also links dependency/world-systems effects 
to urban development. It appears that export dependency may be linked to 
urban primacy (Frey et al 1986), as is foreign capital penetration and world- 
systems position (London 1987). Another set of studies concludes that depen- 
dence on foreign investment encourages overurbanization (Kentor 1981, 
Timberlake & Kentor 1983, Bradshaw 1985, 1987, London 1987) and urban 
bias (London & Smith 1988).. Against this evidence is that of Kelly & 
Williamson (1984a,b), Rosen & Resnick (1980), and Crenshaw (1990) who 
found no clear link between urbanization and urban primacy and various 
measures of foreign investment and trade. Also, Petrakos & Brada (1989) 
found that primacy and foreign capital penetration share a complex relation- 
ship conditioned by the level of development. Finally, a large number of 
studies link foreign investment and world-systems position to hypertrophy of 
the tertiary sector and social and sectoral inequality (Evans & Timberlake 
1980, Timberlake & Kentor 1983, Bradshaw 1985, Timberlake & Lunday 
1985, Bornschier & Chase-Dunn 1985, Snyder & Kick 1979, Nemeth & 
Smith 1985b, Bornschier et al 1978, Nolan 1983). 

Against this are studies by Fiala (1983), who found no direct link between 
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foreign investment and tertiary employment, Bradshaw (1987), who found 
that foreign investment impacts sectoral hypertrophy only indirectly through 
overurbanization, and Semyonov & Lewin-Epstein (1986), who found that 
foreign capital penetration encourages production services, not personal and 
social services. Differences between studies may spring from differing op- 
erationalizations of foreign investment (i.e. stocks versus flows, differing 
timeframes), modeling alternative hypotheses, and sample composition. We 
assume that arguments over proper operationalization and specification, caus- 
al inference, and omitted variables will continue. . 

Although limited support has been found for urban bias theory in conjunc- 
tion with generalized tests of dependency/world-systems theory (Bradshaw 
1987, London 1987, London & Smith 1988), support for the notion that 
government policies and expenditures have a unique influence on urbaniza- 
tion comes from diverse sources. It appears that administrative centralization, 
the centralization of government social services, and general government 
expenditures are positively related to primate development (Petrakos & Brada 
1989, Mutlu 1989, Crenshaw 1990, London 1986, London & Anderson 
1986). Moreover, some policies pursued by Third World governments, such 
as minimum wage legislation, import-substitution, and bowing to pressure 
from trade unions may encourage intense urban concentration (Henderson 
1982, Nemeth & Smith 1985a, McNulty & Adalemo 1988). Finally, it 
appears that ethnic homogeneity and political instability in rural areas are 
positively related to urban concentration (Mutlu 1989, Petrakos & Brada 
1989). Although de Soto's (1989) case study of Lima, Peru stands out as 
evidence of the effects of government on an urban economy, we know of no 
data source that would allow such detailed analyses of legal structures on a 
cross-national scale. 


Critique of Quantitative Research 


The salient problem we find in the research cited above is the frequent 
severity of underspecification. Although studies by Bradshaw (1987), London 
(1986, 1987), London & Smith (1988), Kelly & Williamson (1984b), and 
Crenshaw (1990) stand out as attempts to synthesize the dominant theoretical 
perspectives in single tests of validity, all have their failings. For instance, 
most of the recent studies finding dependency/world-systems effects on 
urbanization and tertiary employment are severely underspecified. None 
make allowances for the influence of demographic processes on urbanization, 
and while London (1988) attempts to illustrate the link between foreign 
investment and lagging fertility decline, some evidence suggests that popula- 
tion growth and overurbanization attract foreign capital, calling into question 
the causal order of these relationships (Crenshaw 1991). Moreover, the size of 
national economies and populations can influence the rate of urbanization and 
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urban concentration; the failure of dependency researchers to incorporate size 
dimensions into their equations determining forms of urbanization may se- 
verely distort research findings, as is suggested by the fact that most measures 
of foreign capital penetration are by definition correlated with size di- 
mensions. In other words, failure to model size dimensions may lead to 
spurious results under certain specifications (Crenshaw 1990). 

On the other hand, most research conducted under a modernization/ 
ecological approach has failed to take international linkages and processes 
into account. Whether or not one views the process of modernity as primarily 
an industrial or capitalist set of phenomena, clearly international relations and 
global flows of capital, technology and population do affect the process of 
urbanization. As Meyer (1986) points out, although hierarchy and dominance 
are ecological principles, these have not been emphasized in empirical re- 
search by ecologists or modernization theorists. 

With this in mind, the value of cross-national research should be apparent. 
Individual case studies and regional comparisons suggest new variables and 
theories that can be tested at the cross-national level, and tests of the universal 
applicability of these determinants of urbanization are vital both to social 
science and policy formation in the Third World. However, the attempt to 
quantify variables suggested by general theory and case studies leads to our 
last criticism of recent empirical investigations: the use of proxy variables. 

Given the futility of urging scholars to search for better indicators (in most 
cases the variables used are the best indicators), we wish to caution scholars 
on the validity of their interpretations of proxy effects. For instance, urban 
bias generally is measured as the ratio of non-agricultural to agricultural 
productivity, a proxy that roughly represents the urban/rural productivity 
differential. Interpreting this as urban bias is problematic, however. First, 
there is no guarantee that all modern production is located in urban areas for 
any given Third World country (e.g. extractive industries are highly pro- 
ductive and are generally located outside of cities). Second, this proxy 
variable does not guarantee that the productivity differential is related to an 
urban bias in public policies concerning infrastructural development, 
macroeconomic policy, or the placement of elite services. Finally, the vari- 
able at best measures the relative efficiency of urban areas over rural areas; 
given equal levels of capitalization and human capital, agglomeration 
economies might be expected to lend cities a productivity advantage over 
agricultural production. 

All this suggests that proxy variables should be used with caution. 
Although we do not hold out much hope for quantum improvements in 
comparative, cross-national statistics, we do hope that national and in- 
ternational funding agencies will begin to standardize the parameters of the 
case studies they fund. Most current theories could benefit from detailed 
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city-specific data on such variables as foreign investment, sectoral distribu- 
tions of the labor force, patterns of social mobility, political/legal systems and 
management initiatives, public ownership of production facilities, and trade 
flows between intra- and international systems of cities. Detailed case studies 
could begin to provide this information for diverse regions around the globe, 
but this potential could only be realized under the coordination and 
standardization of the larger international organizations. 


FUTURE RESEARCH NEEDS 


In addition to our call for improved specification of statistical models and 
standardization of research projects, several topics warrant special attention in 
the years to come. These topics include: (a) the efficiency of cities relative to 
various dimensions; (b) the detailed analysis of the influence of government 
on urbanization and urban economies; (c) the movement away from nation- 
states and toward individual cities and sub-regions as units of analysis; and (d) 
detailed, longitudinal assessments of social mobility, human welfare, eco- 
nomic interdependencies and economic growth within urban informal sectors. 

Because the notion of overurbanization retains theoretical appeal despite its 
conceptual and empirical problems, it is important to expand our knowledge 
concerning the costs and consequences of certain types of urbanization. While 
most economic and sociological research stresses the relationship between 
hyperurbanization and economic growth, efficiency relative to human welfare 
and ecological constraints may be more fruitful avenues for future work. 
Despite the possibility that overurbanization and primacy entail economic 
sacrifices due to the capital costs associated with city-building, it is possible 
that rapid urbanization holds long-term advantages in terms of improved 
quality of life, accelerated human capital formation, the creation of the 
entrepreneurial spirit, and integration into the world’s social and economic 
order. In addition, it is likely that the relative efficiencies of cities are not 
simply functions of city size but also the dimensions of total popülation, 
economic size, and the characteristics of national territories. For instance, tbe 
exigencies created by small size and the aridity of hinterlands may present 
some countries with unpleasant choices between hyperurbanization and abso- 
lute stagnation. Research in this area should attend to ecological constraints, 
in other words. 

Second, our understanding of the influence of political and legal systems on 
the type, rate, and efficiency of urbanization is rather sketchy. Do com- 
monalities exist between nations in terms of development policies, and how 
are these policies related to such phenomena as rural underdevelopment, 
urban concentration, informal employment and industrial mix? Has the struc- 
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ture of foreign investment changed, and what has been the role of government 
in the sectoral mix, rate, and control of foreign investment? What are the 
goals of public ownership of industry, and how does public ownership affect 
urbanization, economic growth, social mobility, and social welfare? 
Although each of these questions has been touched on by recent research, 
unfortunate gaps in our knowledge remain. 

Third, a principal contribution of dependency/world-systems theory is the 
notion that a world system of cities exists and plays a crucial role in the 
development (or underdevelopment) of the Third World. For this reason, it 
would seem appropriate to concentrate on individual cities and their networks 
rather than on nation-states as units of analysis. While work in this area has 
been attempted by Chase-Dunn (1985), Meyer (1986), and others, data 
availability has restricted our understanding of these networks and their social 
and economic implications. Data constraints notwithstanding, interested 
scholars should begin to assemble city-specific information on the structure of 
local economies and labor forces, trade composition and flow, industrial mix 
in terms of technical intensity, carrying capacity in terms of the labor force, 
foreign investment and its forward and backward linkages, municipal political 
structures, and so forth. This is tough work even when possible, but potential 
rewards include in-depth hypothesis testing, newer and better theories of 
urban development, and research results that are policy-relevant and can be 
applied to real-world problems. 

Finally, it is not novel to call for more detailed study of the urban informal 
sectors of the Third World. Fine case studies have considered questions about 
social mobility, factor endowment, intersectoral linkages and labor force 
composition, but room exists for research that is at once breader and more 
detailed. What similarities exist between the distribution of informal sector 
activities of various Third World cities? What is the contribution of informal 
activities to urban and national economies? How does informal employment 
influence rural-to-urban migration, the morphology of cities, or the social 
organization of migrant families and communities. All of these questions have 
been asked before, but our ability to draw generalizations is hampered by the 
idiosyncrasies of case studies and region-specific analysis. We do perceive a 
growing consensus on definitions and the broad characteristics of urban 
informal sectors, however, and we hope that investigators pursuing case 
studies will keep in mind the need for comparisons across cities and nations. 
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Abstract 


Sport is a very prominent social institution in almost every society because it 
combines the characteristics found in any institution with a unique appeal only 
duplicated by, perhaps, religion. The functional, conflict, and cultural studies 
perspectives are reviewed, with additional discussion on how sport relates to 
the processes of socialization and social change. The latter focusses on the 
evolution of sport from a playful, participation-oriented activity to one that 
resembles a corporate form guided by the principles of commercialism and 
entertainment. The role of sport in international relations and national de- 
velopment dramatizes the political meaning of sport to many societies. While 
sport may be integrative at the higher political levels, it has not been so at the 
interpersonal levels of gender and race. The inequality that characterizes 
society's relations of gender and race is found in sport as well. The sociology 
of sport will be able to shed more light on all of these issues when theory 
informs more of the research in this subfield. 


INTRODUCTION 


Sport is an arena of patterned behaviors, social structures, and in- 
terinstitutional relationships that holds unique opportunities to study and 
understand the complexities of social life. Sport is an activity that commands 
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a degree of primary or secondary involvement unsurpassed by other in- 
stitutionalized settings. Sport offers an opportunity for research on "highly 
crystallized forms of social structure not found in other systems or situations" 
(Luschen 1990:59). That is, field research in sport provides, for example, 
structured conflict and competitiveness in controlled settings rarely found in 
other aspects of social life. Group dynamics, goal attainment by social 
organizations, subcultures, behavioral processes, social bonding, structured 
inequality, socialization, and organizational networks are just a few sociologi- 
cal topics that can be studied in sport settings. This view presupposes that the 
structure or forms of behavior and interaction found in sport settings are 
similar to those found in other societal settings. In other words, sport like 
other institutions is a microcosm of society. 

At the same time that sport is a product of social reality, it is also unique. 
No other institution, except perhaps religion, commands the mystique, the 
nostalgia, the romantic ideational cultural fixation that sport does. No other 
activity so paradoxically combines the serious with the frivolous, playfulness 
with intensity, and the ideological with the structural. 

The purposes of this review are twofold: first, to describe the current 
understanding of sport and society through the work of sport sociologists in 
selected areas, and, second, to assess the adequacy of the research and 
theorizing in these fields. The paper is divided into five parts. The first part 
presents the theoretical perspectives guiding research in sport sociology. The 
next two sections examine sport and two social processes: socialization and 
social change. The fourth section looks at an institution—the polity—and its 
relationship to sport. The final section demonstrates how sport reinforces 
racial and gender inequality. Except for the section on the polity, the discus- 
sion focuses on North America, especially the United States. 


THEORETICAL PERSPECTIVES 


At the macro level, the focus of this paper, three theoretical perspectives— 
structural functionalism, conflict theory, and cultural studies—are prominent 
in the works of sport sociologists. The sociology of sport emerged in the 
1960s and 1970s. The earliest contributions to the field were efforts to 
delineate and justify this new subfield as a scholarly and important field of 
inquiry, and they tended to be informed by structural functionalism. The 
structural-functional paradigm focuses on social organizations, how they 
work, and how they are maintained. The functions (i.e. consequences) of 
patterned behaviors for the organizations in question, are emphasized (Frey 
1986). Guided by the tenets of structural functionalism, these and subsequent 
works focused on socialization of youth through sport, sport as a vehicle for 
assimilation, sport as a social system, the relationship of sport to other 
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institutions, and the integrating functions of sport for participants, observers, 
and social organizations. 

These works appeared at a time in American history when society was 
undergoing rapid changes, popular American beliefs were being challenged, 
and authority questioned. Many sport sociologists reacted against structural 
functionalism with its bias for accepting and rationalizing what is; they 
adopted, rather, a theoretical perspective— conflict theory—which argues that 
sport reflects and reinforces the hegemony of societal arrangements, thereby 
perpetuating class and power differentials (Haerle 1974, Brohm 1978, 
Gruneau 1983, Hargreaves 1986, Sage 1990). Those guided by conflict 
theory also focused on social problems in sport such as sexism, racism, 
classism, oppression, organizational conflict, and deviance. 

Recently, a number of sport sociologists have embraced a cultural studies 
approach to sport-related phenomena. From this perspective both functional 
and conflict perspectives are challenged because they are deterministic, that 
is, they tend to ignore or dismiss human agency in social change. Also, both 
theoretical perspectives are considered faulty because they omit the signifi- 
cance of cultural imperatives, where sport is seen as a cultural expression. In 
this view, sport is a socially constructed arena where developmental and 
emergent features are of central significance. Canadian and European scholars 
have taken the lead in writing from this perspective (Gruneau 1983, Har- 
greaves 1986), followed recently by American scholars (MacAloon 1987, 
Harris 1989, Birrell 1989). 

Cultural studies theorists reject the natural science model as the best means 
to know a phenomenon. Rather, a phenomenological approach is preferred. A 
leading proponent asserts: 


The replication of so-called scientific studies of sport has done little to enbance either our 
knowledge or understanding of the nature and meaning of sporting practices. By separating 
sport from its developmental features, the 'variable' approach completely ignores the 
socio-historical and political dimensions of cultural life. 

The specific contribution of cultural studies has been to link up the lived experience of 
human actors, and cultural meanings, texts, representations (culture as interpretation) with 
broader political and economic structures of modern industrial societies (Hollands 
1984:70-71). 


The existence of sport must be explained in terms of something more than 
simply the needs of the social system or the production needs of a capitalist 
economy. Sport is created by people interacting, using their skills and in- 
terests to make sport into something that meets their interests and needs 
(Coakley 1990). 

Change is also important to cultural studies theorists. Sport is “produced” 
out of everyday life (Gruneau 1983). Thus, people act to create their in- 
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stitutions; they are not simply passive responders. At times, however, the 
design of sport is not always desirable (e.g. athletes cannot always transfer 
Schools) because some of those involved are adversely affected, and they do 
not usually have sufficient resources to resist organizational change. For 
example, college athletes can do little or nothing about the way the NCAA 
runs college sports. Sport and any vestiges of culture can be forms of 
resistance (Fiske 1989, Foley 1990), although for the most part existing class 
relations, including those of gender and race, are reproduced (Foley 1990, 
Hargreaves 1986, Gruneau 1989, Adelman 1986). 


SPORT AND SOCIALIZATION 


The involvement of youth, particularly male children, in sport is typically 
encouraged by parents, school administrators, and community leaders be- 
cause this activity is viewed as a very effective setting for learning acceptable 
values and beliefs and for acquiring desirable character traits. Despite a 
barrage of criticism, very strong societal support exists for sport participation 
because of the belief that sport teaches proper values such as self-discipline, 
sportsmanship, and an appreciation for hard work, competition, and goal 
attainment. 

Most claims about the value of sport participation focus on sport as an agent 
of socialization. But despite strong cultural beliefs, there is little evidence to 
support the claims made for the contribution of sport to the socialization 
process. Studies comparing male athletes and male nonathletes (there are very 
few studies comparing women) yield little evidence to support the idea that 
sport is necessary for complete and adequate socialization, or that involve- 
ment in sport results in character building, moral development, a competitive 
or team orientation, good citizenship, or valued personality traits (Dubois 
1986, Fine 1987, Coakley 1987, McPherson et al 1989, Rees et al 1990). 
Sport seems to make little difference. Athletes and non-athletes are compara- 
ble on various personality traits and value orientations. Sport participation has 
no general effect on self-image; it does not reduce prejudice; it is not 
necessary for leadership development; and social adjustment is not necessarily 
enhanced (Fine 1987). The widespread conclusion by sport sociologists is that 
when an apparent socialization effect is found, it is actually the result of a 
selection process that attracts and retains children and youth in sport who 
already have or are comfortable with the values and behavioral traits that 
coaches demand and that lead to success in sport. Those without these desired 
values and traits either show no interest in sport, or they leave sport voluntari- 
ly (i.e. they drop out) or involuntarily (i.e. they are removed by coaches) 
(Stevenson 1975, 1985). 

The debate over the experience of youth in sport stems largely from cultural 
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myths rather than from empirical studies. The content is not so critical; the 
reaction and definitions of the child's significant others are what make a 
difference (Sherif 1976, Fine 1987). Continued participation is likely if 
support and perception are positive; participation will not continue if the 
socializing influence from significant others is absent. Withdrawal takes place 
if the sport role is no longer crucial to identity, and non-sport activities and 
peers become more attractive (Ball 1976, Brown 1985). Aversive socializa- 
tion or a dramatic negative experience can also stimulate withdrawal. Brown 
(1985) found that many swimmers, age 10-12, dropped out of the sport to 
protect their self-esteem in the face of declining success and reduced rewards 
for participation. 

Finally, the socialization of youth and adults, whether participants or not, 
occurs through the media’s presentation of sport. The mass media socialize 
through image management, the manipulation of symbols, and commentary 
(Prisuta 1979, Jhally 1989, Wenner 1989, Sage 1990). The mediated sports 
product that reflects conservative, authoritarian, and nationalistic values is 
warmly received by the audience; it is not forced upon them (Wenner 1989, 
Prisuta 1979). There is implicit political content in the form of value messages 
in the mediated presentation of sport. For example, Wenner's (1986) study of 
Super Bowl pregame commentary found that the value content strongly 
_ endorsed hard work, rugged individualism, and national pride. Media are in a 

sense the creators of culture, conveying information about what is acceptable 
and unacceptable. Thus, the media reinforce established order and value 
consensus by virtue of the presentation, by commentary and pictures of sport 
events. These media presentations can influence our ideas about sport, our 
perceptions of gender, race, social relations, and proper behaviors, and our 
adherence to certain values. 


SOCIAL CHANGE: FROM PLAY TO 
CORPORATE SPORT 


Although typologies are not intended to be accurate representations of reality, 
they are useful for analytical purposes. One such typology is that used to 
distinguish play from sport in the transformation of /udic, playful activity 
pursued for its own sake, to physical activity that is used for extrinsic 
purposes. Historically, this transformation has produced what Page (1973), 
Guttman (1988), and others have described as the secularization, com- 
modification, rationalization, bureaucratization, quantification, and com- 
mercialization of modern sport. 

Huizinga (1949), Stone (1955), Caillois (1961), Edwards (1973), McPher- 
son et al (1989), Coakley (1990), and many others who study sport have 
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found useful the analytic distinction between play and sport. Play is viewed as 
an activity where entry and exit are free and voluntary, rules are emergent and 
temporary, fantasy is permitted, utility of action is irrelevant, and the result is 
uncertain. Play has no formal history nor organization; motivation and satis- 
faction are intrinsic; and the outcome does not have serious impact beyond the 
context of the activity. On the other hand, modern sport as represented by the 
Olympic Games, big-time college athletics, and professional sports exhibits 
the opposite profile. This type of sport is hardly voluntary; rules are formal, 
generalizable, and enforced by formal regulatory bodies (e.g. National Col- 
legiate Athletic Association—NCAA); the outcome is serious for individuals 
and organizations not actually participating in the physical activity, and 
winning (the outcome) is more important than participation (the process). 

As sport becomes institutionalized, particularly at the highest levels of 
amateur and professional competition, it has come to reflect the corporate/ 
commodity model. Sport is more like work than play. The locus of control has 
moved from the player/participant to the manager and audience. Morality and 
ennoblement are replaced by spectacle and entertainment. Play is replaced by 
display (Stone 1955). Attracting spectators and media sponsorships becomes 
more important than the playing process because sport is now driven by profit 
and the market. The ethics of the business and corporate world tend to guide 
sport, not the principles of play and enjoyment (Eitzen & Sage 1989:16—18, 
Sage 1990). We should not be surprised that high-level sport has been 
transformed into a commercialized, commodified, and massified phenom- 
enon. Since the business organization has replaced the family as the basic unit 
coordinating economic activity, and monopoly capital has promoted con- 
sumer markets as the preeminent factor in economic organization, many 
institutions, including sport, in America have been rationalized and corpora- 
tized. The commercial factor is so prominent that even a Gross National 
Sports Product (GNSP) has been calculated. In 1988 the GNSP was $63.1 
billion (Comte & Stogel 1990). This places the sport GNP twenty-second on 
the list of the top 50 industry GNP, ahead of the automobile, petroleum, and 
airline industries. 

The consequences of the commercialization of sport are significant. First, 
changes may be made in the game format or rules. In football, for example, 
the forward pass and the narrowing of the hash marks were changes designed 
to make football more appealing to viewers and spectators. Second, the 
orientation or values of the participants may change from those based on 
self-development and satisfaction to those of entertainment and self-interest. 
Coakley (1990) has described this as a change from an aesthetic orientation 
that emphasizes the beauty and pleasure of movement, skill and ability, and 
lifelong activity to a heroic orientation that emphasizes danger and excite- 
ment, style, and a short-term commitment to victory. There is a developing 
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industry of sport sciences for the primary goal of performance enhancement. 
The emphasis is on strategies, technical improvement, nutritional and psycho- 
logical intervention, or any technique to manipulate or engineer the athlete to 
perform better. This has been called the "scientization" of athletic sport; it is a 
trend consistent with the instrumental goals of corporate sport (Brohm 1978). 
Third, control and influence are largely in the hands of persons and organiza- 
tions who are not direct participants. Leagues, regulatory groups like the 
NCAA, media, event sponsors, owners in professional sports, athletic de- 
partments and alumni in collegiate programs are examples of the centers of 
influence. This has forced professional athletes to take measures to enhance 
their own impact on decision-making; these include labor unionization, repre- 
sentation by agents, and recourse to legal challenges. The fourth consequence 
of commercialization is the decline of amateurism and the rise of professional- 
ism (Sewart 1985, Eitzen 1989). Elite amateur sport is corporate sport, 
whether it is related to the Olympic Games or to collegiate sports. True, 
amateur sport is characterized by participation for the love of the sport, not the 
extrinsic rewards. Intrinsic motivation and reward are still relevant, but their 
significance seems to be devalued by the rising importance of monetary 
rewards (legal and illegal) for athletes, the exaggerated importance of winning 
that translates into treating athletes as interchangeable parts, the increase in 
the incidence of athletes using performance enhancing drugs, and the associa- 
tion of athletic success with outcome goals of profit, visibility, entertainment, 
and community/organization prestige. 

The commercialization of sport is strongly influenced by the role of the 
media, particularly television, in programming the sports product and in 
the monies paid to sports organizations for broadcasting rights. The role of the 
media has increased rapidly in recent years. For instance, in 1980 the three 
major television networks broadcast live sports programming only 787 hours. 
In 1989, however, the three networks plus ESPN, SCA, and TBS channels 
broadcast 7341 hours of live sports programming (Stogel 1990:48), an in- 
crease of nearly 900% in the last decade. The influx of media money is 
obviously substantial. The National Football league, for example, sold rights 
to its games from 1990 through 1993 for a total of $3.6 billion, increasing the 
league's yearly income from television about 90% over 1989. 

The athletic establishment covets television coverage of its events because 
of the contribution the media make to profits and to exposure to a. mass 
audience. The media pursue sports programming because it is cheaper to 
produce than other types of programs .and it enables the media to reach a 
normally difficult-to-reach audience of young, college-educated males with 
disposable income (Coakley 1990:281). This makes sports programming 
attractive to certain advertisers. Thus, the media and sport emerge in a 
symbiotic relationship, particularly economically. It is important to under- 
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stand that the media play a dominant role because sport is primarily directed 
by commercial guidelines. 


Television simply expands the commercial interests that are already an inherent part of 
spectator sports in capitalist societies. Although some changes are uniquely linked to the 
special needs of television coverage, the real reason for most of the changes occurring over 
the past 3 decades has been the desire to produce more marketable entertainment for all 
spectators and a more attractive commercial package for sponsors and advertisers (Coakley 
1990:280). 


However, once a sports entity has been displayed on television and received 
the financial support from television, the sports organization is forever 
changed (Parente 1977, Altheide & Snow 1978). 

The nature of sport has been changed by the media with its emphasis on 
display or what has been called "entertainmentization." Many changes in the 
nature of sport have resulted from media's influence and the desire of both 
media entrepreneurs and representatives of the sport establishment to enhance 
the appeal of their sport product to maintain profit margins. Thus, the media 
sometimes glorify violence, create heros and heroines, demand that athletic 
schedules be shifted, and that rule changes be made to enhance the product, 
increase the audience, and generate greater revenues from advertisers. 

The preeminent role of media in sport means that TV media's presenta- 
tion of sport is “mediated.” That is, viewers see a representation of sport as 
it is depicted by commentary, shot or scene selection, and editorialization 
(Comisky et al 1977, Coakley 1990). Because the public rarely questions 
the media's construction of reality, particularly that treated by the press 
and television, the media play a role in shaping images and opinions re- 
garding sport (Greendorfer 1983, Tuchman 1978). The media's representa- 
tion of an event is taken as reality, overlooking the reality that it is a staged 
event mediated by commentary. What is presented is generally consistent 
with the commercial and entertainment agendas of media and sport establish- 
ment. 

One of the sport-related behaviors that has been projected as a result of 
media's definition of sport is gambling. Not only do media outlets provide the 
information a gambler needs, they also are a source of legitimation of 
gambling. The inclusion of point-spreads and injury reports in descriptions of 
upcoming games and the promotion of gambling celebrities and analysts 
together communicate the message that gambling is an acceptable activity and 
that gambling on sport is a natural accompaniment of sport, even though it is 
legal in only two states. Thus, the symbiotic relationship of sport and the 
media is enhanced by the provision of gambling information that appeals to 
viewers and readers (D'Angelo 1987). 

As an activity, gambling is widespread and essentially a legitimate activity 
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to most (Rosecrance 1988). The dated (1974) but very comprehensive study 
of America's gambling behavior reported that just under two thirds of Amer- 
icans had gambled in the previous twelve months. Over 40 million persons 
had bet on sport. A 1984 Gallup Poll and the 1983 Miller Lite Sport Survey 
reported, respectively, that 17% and 23% of the population bet on sport (Frey 
1985). In 1989, Americans gambled $290 billion on all types of games, an 
increase of nearly 100% from 1982. Of this amount $43 billion was bet 
illegally, and nearly 70% of that was bet on sports. Sports betting in 1989 
represents approximately 11% of all gambling, an increase of 42% over 1982 
(Christianson 1990:8). Sports betting is very popular, and this popularity is 
promoted by the fact that the natural association of sport and gambling is 
frequently part of the content of the media's presentation of and commentary 
on sport events. 


SPORT AND THE POLITY: INTERNATIONAL 
RELATIONS 


Two major motives govern a country's political and economic activity. The 
first is that of building an efficient but dynamic modern state. In other words, 
the country should exhibit an acceptable standard of living, a stable political 
order, an equitable system of social justice, and social behavior governed by 
controlled, rational, not impulsive, procedures. The second national motive is 
to be recognized as a responsible actor in the international community of 
nations (Geertz 1963). In other words, nations: seek internal stability and 
external status. Sport is commonly viewed as a vehicle to achieve both. 
The integrative role of sport for a nation is similar to the structural- 
functional role assigned to sport in communities, in the classic studies by the 
Lynds (1929), Hollingshead (1949) and Stone (1981) and the recent study by 
Wilkerson & Dodder (1987). Integration to counteract internal racial, ethnic, 
regional and class diversity and conflict is a major reason that nations promote 
participation in highly visible international events. Lever's study of soccer in 
Brazil documented the way that sport gave a diverse population something to 
share in the name of national solidarity (1983:19). Thus, sport can contribute 
to a national identity or sense of nationalism that temporarily overrides 
differences. The world rugby championships provide an interesting example. 
Annually when national teams are selected, the warring factions in Ireland 
and Northern Ireland lay aside their bitter animosities, combining players to 
form the Irish national team which members of both nations cheer. Gov- 
ernments will often encourage international competition in acknowledgment 
of the unifying role of sport (Anthony 1969, Frey 1984, Riordan 1986). 
However, critics employing a conflict perspective assert that the use of sport 
to encourage the homogenization of a population represents an effort to 
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control that population by instilling certain dominant values, which reduce the 
likelihood of a challenge against those in power (Klein 1989). 

Involvement in international sport requires participation in networks of 
organizations that are transnational in scope. Before this participation can be 
effective, an organizational base staffed by competent personnel must be 
developed. Sport provides a mechanism to link organizations and to develop 
managerial and administrative skills among the local population. Several 
Latin American countries, including Cuba, have done this (Arbena 1985). 
Nicaragua used baseball as a mechanism to rebuild internal institutions after a 
revolution and to reinstitute regional and intranational communication and 
coordination. In some cases, Cuba and the Dominican Republic, for example, 
the use of sport to enhance the human capital of an indigenous population 
serves to reduce the dependency of a nation of smaller size and limited 
resources on the larger nations (Frey 1988, Klein 1989). 

The primary role of sport in international relations is one of public di- 
plomacy. That is, sport serves to articulate secondary national interests (e.g. 
visibility, ideological expression, status enhancement, legitimacy), to test 
foreign relations initiatives (e.g. ping-pong diplomacy between the United 
States and China), to enhance cultural exchange and understanding, and to 
reduce the potential for actual conflict by playing out hostilities in a restricted 
and controlled setting (Reich 1974, Frey 1984, Riordan 1986). Thus, sport is 
an activity of international cooperation and interaction, but it is peripheral to 
the survival of a state political system (Frey 1984:72). Olympic record 
performances are irrelevant to hard-core negotiations over primary national 
interests that if unresolved could lead to war. 

Nevertheless, status in the community of nations is ultimately related to 
success in athletic events. The gold medal count in the Olympics is important 
precisely because that count becomes a measure of political legitimacy, of 
modernization, or of a people's resolve (Espy 1979, Frey 1984, Heinila 
1985). 

The motivation to achieve acceptable status in a community of nations and 
thus to translate this status into political advantage is not limited to developing 
countries. Eastern bloc nations have spent enormous resources to achieve 
success in international competitions. The United States and other western 
nations have been reluctant to admit their keen interest in international sports 
success, but it is there nonetheless. International sports success is a very 
serious goal in the United States, for success is interpreted internally and 
externally as "proof" of the superiority of a nation's social, economic, and 
political systems. Thus, the only difference between the United States and the 
nations of the Eastern bloc and the Third World is that the United States does 
not admit that its international sporting efforts are serious (Frey 1984, Riordan 
1986). 
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SPORT AND INEQUALITY: RACE 


American sport sociologists have devoted considerable attention to the ex- 
amination of racial discrimination in sport. The major conclusion of this 
work, devoted for the most part to comparing whites and blacks, is that just as 
racial discrimination exists in society, it exists in sport. Blacks do not have 
equal opportunity; they do not receive similar rewards for equal performance 
when compared to whites; and their prospects for a lucrative career beyond 
sport participation are dismal. 

Americans remain comfortable with the cultural myth that the United States 
is an open society and that athletic excellence is an avenue of upward social 
mobility. Black subcultures reinforce this belief (Edwards 1984). The myth 
seemingly is supported by two facts. First, although blacks make up 11-12% 
of the US population, they are vastly overrepresented in sport, with (1988 
data) blacks comprising 21% of major league baseball players, 57% of 
professional football players, and 73% of professional basketball players. 
Second, many black athletes receive very high pay, some the highest in their 
sport. Research by sport sociologists, however, provides irrefutable evidence 
that blacks are the objects of discrimination in sport. 

To begin, while blacks are overrepresented in some sports, they are un- 
derrepresented in others. Phillips (1976) argues that the reason blacks excel 
and are disproportionately found in some sports lies in what he calls the 
"sports opportunity structure." Blacks are found in those sports in which 
facilities, coaching, and competition are available to them: in the schools and 
community recreation programs. They are rarely found, however, in those 
sports that require the facilities, coaching, and competition usually provided 
only in private clubs or that are otherwise too expensive or exclusive to 
obtain. 

Once blacks enter the ranks of sport, at whatever level or whatever sport, 
discriminatory practices continue. These take three forms: position allocation, 
analysis of performance differentials, and reward structures. One of the best 
documented forms of discrimination at both the college and professional 
levels is popularly known as "stacking." This term refers to situations in 
which minority group members are relegated to specific team positions and 
typically excluded from competing for others. Thus, sport reproduces the race 
relations found in society. Blacks tend to be "stacked" in those team positions 
that match racial and ethnic stereotypes, that is, they are placed into positions 
that require physical skills while whites are disproportionately found in 
positions that require intelligence, leadership, and that have greater outcome 
control. 

In an early analysis Grusky (1963) observed that the importance of a 
position in an organization depends upon that position's spatial location or 
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relevance to outcome, and the degree of interaction associated with that 
position. He said the more central one's spatial location: (a) the greater the 
likelihood dependent or coordinative tasks will be performed; and (b) the 
greater the rate of interaction with the occupants of other positions (Grusky 
1963:345—46). Centrality has become a significant concept in analyzing the 
allocation of blacks and whites to positions on sport teams (Loy & McElvogue 
1970). Racial segregation in sport is positively related to position centrality. 
Whites are typically found in central positions that require intelligence, 
coordinative and decision-making skills, and high rates of interaction, and 
thus greater outcome control— positions such as offensive center, quar- 
terback, and middle linebacker in football, pitcher and catcher in baseball, 
and point guard in basketball. Blacks are channeled to noncentral positions 
that require physical skill such as speed and quickness, less interaction, and 
that have less impact on outcome—positions such as running back, receiver, 
defensive line, and corner back in football, the outfield in baseball, and 
forward in basketball (Loy & McElvogue 1970, Scully 1974, Yetman & 
Eitzen 1984, Leonard 1988). 

Since Loy & McElvogue first noted the stacking phenomenon in team 
sports (1970), research ongoing to the present finds these patterns for college 
sport (Schneider & Eitzen 1979) and women's sport (Eitzen & Furst 1989), as 
well as for professional sport, although it is no longer found in professional 
basketball where blacks have such a numerical superiority. Moreover, this 
pattern of racial stacking has also been found in other sports such as rugby, 
soccer, cricket, and hockey in other societies such as Canada, England, and 
Australia (Best 1987, Maguire 1988, Lapchick 1989, Lavoie 1989). 

Biological explanations of stacking, which are reflections of racial 
stereotypes, have been refuted (Eitzen & Sanford, 1975). Another explana- 
tion focuses on modeling, arguing that blacks choose to play certain positions 
because they desire to emulate role models (McPherson 19752). This makes 
the exclusionary system self-perpetuating since most role models tend to fill 
noncentral positions. Economic explanations assert that the economic costs of 
developing play skills at certain positions such as quarterback are greater than 
for others (e.g. outfield); because blacks occupy lower socioeconomic status 
in American society, they will choose to play noncentral positions where the 
development of required skills is less costly. As black education improves, 
the proportion found in central positions will improve (Medoff 1986). Pre- 
sumably, improving SES is the result of better education and improved 
experience in decision making and developing intellectual skills. However, 
no evidence supports this trend in the broader society (Yetman 1987, Phillips 
1988). In fact, black economic income, defined as a percentage of white 
income, has declined over the years. 

Moreover, the economic hypothesis presumes that blacks will enhance their 
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human capital through educational achievement. It is well documented, 
however, that on every measure of educational progress blacks score less well 
than whites. Eitzen & Purdy's study (1986) of college recruiting confirmed 
that sports tend to recruit the academically marginal blacks, thereby project- 
ing a higher failure rate. Black athletes are exploited for their skill and given a 
scholarship, but they receive no education. The higher academic failure rate 
perpetuates the myth that blacks have superior physical skills, but inferior 
intellectual skills. 

Another mechanism of discrimination is called the "the unequal opportu- 
nity for equal ability" hypothesis. Blacks must be better than whites to be 
admitted to college and to remain on athletic teams. Research has shown 
consistently that blacks are disproportionately found as starters and "stars" on 
the team, whereas whites are found disproportionately as nonstarters. After an 
elaborate study of baseball, Scully concluded, "Not only do blacks have to 
outperform whites to get into baseball, but they must consistently outperform 
them over their playing careers in order to stay in baseball" (Scully 
1974:263). In Jonathan Brower's words, "mediocrity js a white luxury" 
(1973). This interesting relationship has been found in professional baseball 
and basketball, where the detailed statistics in those sports make such studies 
possible. This pattern also exists at the college level, where it is manifested in 
two additional ways. First, blacks must exhibit higher athletic skills than most 
of their white teammates in order to receive a scholarship (Evans 1979). And, 
second, blacks are more likely than whites to be recruited from community 
colleges, which means that universities make a relatively smaller investment 
in blacks, and that the universities are relatively assured of getting athletes 
with proven athletic abilities (Tolbert 1975). 

Blacks are underrepresented in sports journalism, in officiating, and in 
sports administrative positions (Lapchick, 1990). Most visible is the paucity 
of blacks as head coaches or managers. Black women who aspire to coaching : 
and management positions are victims of double jeopardy—their race and 
their gender. The lack of black coaches and managers is likely the result of 
two forms of discrimination. Overt discrimination occurs when owners ignore 
competent blacks because of their prejudices or because they fear the negative 
reaction of fans to blacks in leadership positions. The other form of dis- 
crimination is more subtle. Blacks are not considered for coaching positions 
because they did not, during their playing days, play at positions requiring 
leadership and decision-making due to stacking. 

Most Americans believe that participation in sport is a mechanism to 
improve race relations (Miller Lite 1983). This view is an expression of the 
contact hypothesis which suggests that exposure to other racial groups 
through interaction reduces prejudice. Although one study (McClendon & 
Eitzen 1975) has found some support for this hypothesis in limited situations 
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(when both races on a team contribute to winning and the team is successful), 
most research does not find that interracial contact in sport reduces racial 
prejudice (Chu & Griffey 1989, Lapchick 1989, McPherson et al 1989). 
The history of desegregation in American sport shows clearly that owners 
and coaches integrated teams when they realized that winning leads to profits 
and that skin color does not matter if teams win (Coakley 1990:209). Thus, 
the commercial interests of those who control sport override cultural views. 
This is substantiated further by the finding that attendance at sports events are 
not responsive to the racial composition of teams (Koch & Vander Hill 1988). 


SPORT AND INEQUALITY: GENDER 


Most of what is known about sport is based on studies of white males. Sport 
and the values associated with sport have traditionally been those relevant to 
males not females. Any research that did include gender typically assumed 
that there was an inherent conflict between being a woman and participating 
in Sport (Hall 1988). Thus, sport has tended to celebrate the achievements of 
men while marginalyzing the status of women by placing women in ex- 
pressive, supportive roles such as cheerleaders, or relegating participation by 
women to a secondary status. Sport has been as a result largely a “male 
preserve" supported by institutional practices of discrimination against 
women. 

In a manner similar to the experiences of blacks in sport women have been 
prohibited from full and equal participation because of formal restrictions and 
cultural predispositions. 

Several myths have evolved with respect to the participation of women in 
sport. These include the idea that sport is harmful to the female reproductive 
system and thus a threat to child bearing; that sport masculinizes a female in 
appearance; that the development of male masculinity is threatened if girls 
outperform adolescent boys in sport; that human and economic resources are 
wasted because the performance levels of females are significantly lower than 
those of males; and that sport is not important for the social development of 
women because the values of achievement, aggressiveness, and competition 
are irrelevant to the life experience of women. Even though all of the above 
have been refuted by evidence (for a summary, see McPherson et al 1989), 
these myths remain influential, causing stigma and role conflict for some 
women athletes. Although women athletes may experience role conflict, 
research shows that for most it is a positive experience. Data comparing 
women athletes and non-athletes from the United States, Australia, and India 
suggest, for example, that women athletes have a better self-image, a better 
body image, and a better outlook toward life than nonathletes (Snyder & 
Kivlin 1975). However, these results could be the product of selection, not 
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socialization, and the outcome of strong in-group bonds formed by a sub- 
culture of athletes who acknowledge their differences from most women. 

The most serious problems women face in sport involve discrimination. 
Title IX or Affirmative Action Legislation (1972) was designed to rectify 
discrimination, particularly in access to facilities, financial aid, and opportu- 
nity for participation. Some improvement was made, but when the US 
Supreme Court ruled in Grove City v. Bell (1984) that Title IX language 
applied only to programs receiving federal funds, progress was stalemated, 
with some 800 federal investigations involving possible sexist practices 
dropped or narrowed (Sabo 1988). The original broader interpretation of Title 
IX was restored by the Civil Rights Restoration Act of 1988, but the pace of 
investigations has not resumed, since it depends on the will of the Justice 
Department. 

A dramatic example of discrimination by denying access to the control of 
sport is found in Acosta & Carpenter's (unpublished) analysis of the participa- 
tion of women in the administration of athletic programs. Since Title IX, 
schools are offering more sports for women, and participation by women as 
athletes has shown a dramatic increase. However, the proportion of women 
who are coaches or admininstrators has declined significantly. In 1972, 90% 
of women's teams were coached by women; in 1989, 4795 were coached by 
women. Similarly, the non-coach administrators of women's sports programs 
tend to be men, and the few women administrators in place are supervised by 
men. 

The discrimination against women in sport has been documented in many 
areas and continues. The greatest promise of scholarship in this area, howev- 
er, is theoretical. Feminist social theorists assert that sport is one of many 
social systems dominated by patriarchal value systems and dominance pat- 
terns (Birrell 1984, Hall 1988). The starting point of feminist social theory is 
the understanding of the dominance patterns found in patriarchal social 
structures (Hall 1984:88). Since sport is a cultural form, emergent, changing, 
and subject to the influence of gender and class, the meanings attached to 
sport participation cannot be properly assessed with traditional empirical 
methods. In the view of most feminist theorists, a feminist view must be 
cultural, humanist, interpretive, phenomenological, and value oriented. Thus, 
there is an affinity for the incorporation of the cultural studies approach by 
feminist scholars. 

An interesting variant of gender scholarship and theoretical development 
has been through the study of masculinity and sport. Two of these de- 
velopments appear to be especially fruitful (Messner 1990). Some pro- 
feminist, male scholars have used their own biographies to understand mascu- 
line worlds from a feminine viewpoint that emphasizes institutional patriarchy 
(Sabo 1986, Kimmel 1987). Thus, the emphasis is on the links between the 
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costs and privileges of masculinity. The other approach is called “inclusive 
feminism." Messner, a leading proponent of this approach, summarizes: 


Through an inclusive feminism that recognizes the importance of working from multiple 
standpoints, we can begin to build an understanding of how class, racial, and sexual 
struggles within hierarchies of intermale dominance serve to construct men's global 
subordination of women (1990:149). 


Just as scholars in other subfields in sociology, scholars in the sociology of 
sport are beginning to describe dominance and subordination and to analyze 
their complexity as they are simultaneously structured along racial, gender, 
and class lines (Birrell 1989). 


CONCLUSION 


Over a decade ago Gunther Luschen summarized the status of sport sociology 
as a subdiscipline of sociology in volume 6 of the Annual Review of Sociology 
(Luschen 1980). He asserted that sport sociology had garnered some follow- 
ers who published sport-related articles in over 100 different scholarly jour- 
nals. It appeared that sport sociology was on the verge of expanding as a field 
of social inquiry and of gaining acceptance in mainstream sociology. This 
promise, however, has not been realized. 

Even with the formation of the North American Society for the Sociology 
of Sport (NASSS) and the publication of its Sociology of Sport Journal, sport 
sociology remains somewhat of an orphan speciality. The critical mass of 
theorists and researchers required to promote collaborative efforts, network 
formation, and professional identity has not emerged (Kenyon 1986, Coakley 
1987). Even with the routine inclusion of sport sociology sessions on region- 
al, national, and international conference programs, the profession of sociolo- 
gy has not accepted the study of sport into its mainstream. Neither has sport 
sociology attained high status in physical education where there are more 
physical educators claiming sport sociology as an area of study than there are 
such members of the American Sociological Association. Few graduate pro- 
grams are available, and fewer courses are found in standard undergraduate 
curriculums. Sport research is often an "after-thought," pursued as an aca- 
demic interest only after "serious" work is done. 

The most serious charge against sport sociology is that the theoretical 
development is relatively weak (Luschen 1980, Kenyon 1986, Coakley 1987, 
MacAloon 1987). Coakley summarizes this criticism: 


Unfortunately, much of the research in sociology of sport has been neither cumulative or 
theory-based, nor has it been dedicated to theory development (Kenyon 1986). More often, 
research has been designed to describe sport in ways that call popular beliefs into question, 
or to document the existence for an issue or problem. This is true of the field as a whole, 
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but it is especially true of work done in the U.S. This is not to say that theory has not 
informed some of the work done by American sport sociologists, but little of their research 
has grown directly out of concern for theory testing or theory development in sociology 
(1987:14). 


As a subdiscipline the sociology of sport is only 25 years old, and its 
professional association has been in existence just over a decade. This sub- 
discipline has the same problems found in the rest of sociology. While much 
of the research in sport sociology is not guided by theory, some very impor- 
tant work is. Leaders in the field are calling for theoretically based studies 
(see, Kenyon 1986, Birrell 1989), and some exemplars are leading the way. 
The work of gender theorists appears especially promising not only for the 
sport sociology but for sociology in general. This is because sport is such a 
fruitful arena in which "to take into account the contours of the particular 
relations of dominance and subordination that exist among groups located at 


the intersection of class and racial conflicts" (Birrell 1989:221). 
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Abstract 


Comparative work on the state, in terms of both new cases and general 
theoretical frameworks, has proliferated in the last decade. In this new work, 
traditional categories of differentiation such as structural-functionalism, 
Marxism, and pluralism have lost relevance and have been replaced by 
common conceptual strands that infuse every subfield of research on the state. 
In this review we select the most important conceptual strands and examine 
new research in state formation, the state's role in economic development, 
and the state's relation to social movements and ethnic identity formation. 
Despite the frequent criticism that the state is the only explanatory variable in 
these studies, we find that the literature has always recognized that state- 
society relations are critical to understanding state action. 


INTRODUCTION 


The last few years have seen a tremendous boom in the study of the state 
within the fields of sociology and political science. Scholars ranging from 
comparative historical sociologists to specialists in the economics of develop- 
ment have begun to pay attention to the role of the state, primarily in the 
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context of Western Europe and North America. Using the state as a central 
concept, they have developed categories to explain its role in social, eco- 
nomic, and political change. Their influence has been far-reaching; for ex- 
ample, scholars of developing countries have always been aware of the 
importance of state institutions, but they are increasingly adopting the 
categories used by the new state-centered theorists. 

The study of the state is not new in sociological and political analysis. 
Rather, the focus is new. As Gabriel Almond argues, a segment of the 
political science literature has always incorporated the state in its analyses 
(1988). In support of this claim Almond cites the work of such eminent 
political scientists as David Truman, E. E. Schattschneider, V. O. Key, and 
Pendleton Herring, among others. But either these writers tend to focus on 
government conceived of as a collection of individuals performing specific 
functions, or they limit themselves to circumscribed institutions within the 
state, such as the Presidency or Congress. They rarely view the state as an 
administrative apparatus where administration means the extraction of re- 
sources, control and coercion, and maintenance of the political, legal, and 
normative order in society. In the new focus on the state, this latter set of 
activities predominates. It reveals a move away not only from the pluralist but 
also from the Marxist perspectives of the state toward a more Weberian 
framework of analysis. Pluralist and Marxist categories such as classes and 
interest groups have been retained to describe society, but the state is being 
reconceptualized almost entirely around Weber's definition. 

In the introduction to Bringing the State Back In (Evans et al 1985), one of 
the most widely cited and probably most influential works in the new statist 
approach, Theda Skocpol characterizes prior comparative research as being 
excessively society-centered. As a result of this bias, the state became merely 
the arena in which social conflicts are waged. To accord the state its rightful 
place in social science research, Skocpol argues for “a fundamental rethinking 
of the role of states in relation to economies and societies" (1985:7), and she 
points to a growing body of work that treats the state as a central explanatory 
variable. In her essay Skocpol is careful to note that "Studies of states alone 
are not to be substituted for concerns with classes or groups; nor are purely 
state-determinist arguments to be fashioned in the place of society-centered 
explanations (1985:20). Nevertheless, her argument is often construed as 
dichotomizing the literature into state-centered and society-centered ap- 
proaches. Even if this dichotomy was applicable in the past, it seems less 
relevant today, when the state is almost always part of the explanation. In its 
place we offer a tripartite division of the state-society literature. 

Early research within this statist perspective was still laying the ground- 
work of the new approach, and therefore scholars initially emphasized the 
state as the central explanatory variable (Block 1977, Trimberger 1978, 
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Skocpol 1979, Levi 1981). They advanced a number of new arguments. First, 
they conceived of the state as an actor with interests of its own which do not 
necessarily reflect those of society. In this approach's ideal case of a strong 
state, autonomy from society was assumed to endow the state with the 
capacity to act to fulfill its interests. At the same time as these initial 
state-centered views set the new intellectual agenda, a parallel strand of 
empirically driven research was drawing on empirical cases to articulate a 
more moderate vision of the state's role by embedding it in its societal context 
(Stepan 1978, Evans 1979, Rueschemeyer & Evans 1985, Katzenstein 1985). 
In the ideal case of this approach, the state is still strong, but it is acting in 
relation to society, and analysis of state action has to be analyzed within 
specific socioeconomic and sociocultural contexts. Finally, a third avenue of 
research accepted a central role for the state but argued that the state's relation 
to society is essentially a contested one (Callaghy 1984a,b, Kohli 1986, 
Azarya & Chazan 1987, Migdal 1988). Ín this research the ideal state 
triumphs in its struggle with society and develops autonomy and capacity for 
action as a final result of a compact between itself and societal forces. By 
contrast, unsuccessful states are constrained or even dominated by their social 
context and are unable to act independently. The strength of this latter 
approach rested in its balanced treatment of state and society; it assumed no a 
priori bias in favor of the capabilities of each. In addition, it had the advantage 
of cutting across the standard developed/developing dichotomy that character- 
izes comparative research. 

In all three perspectives, several dimensions are always present. A survey 
of the recent literature reveals that the state in empirical research is op- 
erationalized in terms of specific institutions and actors. Thus, for example, 
the state of Argentina in the late 1970s is coterminous with the military, while 
in Zaire, Mobutu's actions are characterized as state actions. Nevertheless, 
the review format challenges us to reveal the underlying similarities among 
different approaches. In this literature, different institutions are studied with 
the justification that they shed light on the state. Therefore, our use of “the 
state" emphasizes the theoretical similarities over the empirical differences. 

One of the dimensions applied least consistently is that of state strength. 
State strength is usually some (nonadditive) combination of state autonomy 
and capacity (Evans et al 1985, Migdal 1988), and it can include both 
infrastructural and despotic power (Mann 1986b). Perhaps because the con- 
cepts of autonomy and capacity are so closely related, they are often collapsed 
together in state analyses. However, we maintain that it is important to 
recognize analytical distinctions between them and then to study the manner 
in which the two concepts interact. Autonomy refers to the state's ability to 
formulate interests of its own, independent of or against the will of divergent 
societal interests. Autonomy is not necessarily a characteristic of the state in 
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its totality; a state can be autonomous in certain domains and dependent in 
others. Capacity is defined here as the state's ability to implement strategies to 
achieve its economic, political, or social goals in society. The state may 
acquire capacity through institutions such as the bureaucracy, or through 
resources such as external ties to entrepreneurs and finance capital (Evans 
1989). Although they are invariably discussed as attributes of the state, 
autonomy and capacity are not merely a function of state organization and 
resources; they are also determined by the state's relation to society. Re- 
sources, organization, and state-society relations are constantly interacting 
with each other; these interactions in turn reshape the nature of state autonomy 
and capacity. Finally, the distinction between infrastructural and despotic 
power is adopted to refine the categories of state strength and capacity. States 
with despotic power are often characterized as strong, but their levels of 
autonomy and capacity are highly variable. By contrast, states with in- 
frastructural power enhance their capacities to carry out their goals. 

In this essay we examine recent research on the state within three major 
areas of inquiry, chosen because they represent important theoretical ques- 
tions that have been applied to both western and nonwestern contexts. First 
we discuss recent additions to the research on state formation, reformulating 
the central tenets of that framework with an explicit state-society focus. We 
turn then to an extended discussion of research on the role of the state in 
economic development, paying close attention to the relationship between 
autonomy, capacity, and state action across states with diverse economic 
programs. We then examine the increasing scholarly attention to the influence 
of the state on social movements, ethnic mobilization, and ethnic identity, 
considering both the effects of state action on collective behavior and the 
state's own interest in shaping society along these dimensions. We conclude 
by suggesting avenues for further research, both generally within the field of 
comparative studies and more specifically in the areas to which we have 
devoted special attention. 


STATE FORMATION IN THE WESTERN WORLD 


The literature on state formation is well-established in sociology and political 
science. It is richly informed by historical research and theoretically sophisti- 
cated. Many of the categories were formulated in the 1970s, and newer 
contributions tend to be assessed either by how well they refine specific 
aspects of a major theoretical approach or by how they extend and define our 
understanding of the parameters of these categories. Therefore, the review 
here follows established categories and discusses the contribution of the 
recent scholarship to these. We defend the premise that the state formation 
literature, despite its theoretical formulation by sociologists and political 
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scientists who claimed a primordial role for the state in socioeconomic and 
political development, never dared conceptualize the state as entirely outside 
of or separate from society. Even those scholars generally considered to be 
strong "statists" conceptualized the state as one of the many alternative 
organizations within society in medieval Europe. To be sure, it was the one 
institution that managed to use an organizational advantage to its own benefit, 
to exercise control and domination over society, but the state did not achieve 
this without a struggle. Whether these formulations focused on the impact of 
external forces and war making, internal organization, or cultural dimensions, 
they all were formulated within the context of state-society relations and 
state-society struggles in particular. Furthermore, arguments within all three 
of these categories are historically specific arguments stressing the ex- 
traordinary conjunction of events that brought about states in late medieval 
Europe. 


Warfare and State-Making 


A variety of political units and organizations contended for power and control 
within the confines of medieval European society; these included feudal 
kingships, empires, the church, and free cities. State-makers who were 
ultimately successful at building centralized, differentiated organizations with 
a monopoly of coercion over a defined territory were those who undid and 
then redid the structure of society according to their own agendas. In the 
process, they encountered resistance from the different segments of the 
populations they attempted to subjugate and tax as well as from the existing 
alternative organizations (Tilly 1975, 1981, 1984). 

The need for the imposition of this new political order did not come from 
within society. Instead, it was the result of external conflicts, long struggles 
between protostates, warrior houses, and other diverse political units in the 
international context of incessant warfare (Hintze 1975, Tilly 1975, Finer 
1975, Mann 1986a, 1988, Zolberg 1986). Elias in particular stresses the 
struggles between two warrior houses, the Capetians and their successors as 
Kings of France and the Plantagenet Kings of England, a struggle that formed 
the two states but that also initiated the process through which Europe was 
transformed into a system of states (Elias 1982, Strayer 1970; Zolberg 1986). 
This focus on international conflict and war-making as the main mechanisms 
spearheading state formation has led to numerous studies of warfare, military 
organization, police forces, and the necessary extractive apparatus that must 
be developed to maintain such expenditure (Tilly 1975, Ardant 1975, Finer 
1975, Mann 1986a, 1988). 

States that fought wars, built standing armies and required resources for 
state-building activities directly confronted societies that often were unwilling 
to help in this endeavor. The resulting state formation often was the product of 
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bargaining, cooptation, legitimation, and sheer coercion between state- 
makers and societal forces, with society as the losers in the struggle being 
forced to pay (Tilly 1975, 1981). In their analysis of these relations some 
scholars emphasized the military aspect in the conscription of the population 
(Hintze 1975), while others developed the more economic aspects of the 
bargaining between states and different societal classes, especially merchants 
and aristocrats (Lane 1958, 1979). Tilly combines these two aspects in his 
study of European state makers who threaten war unless "protection" money 
is paid (1985). Lachmann has presented the economic aspects of state forma- 
tion and state society bargaining relations in his review of the origins of 
capitalism and its various interpretations (1989). State-society relations are of 
primordial importance in this literature even if they are not specifically 
discussed, since states carve out their organization and their domination from 
within society. Therefore, it is inconceivable to separate the two. Nonethe- 
less, while both state and society are crucial actors, states are dominant. 
States are either strong or getting stronger; they are autonomous vis-à-vis 
Society; and they are capable of imposing their will on society. In the 
conventional argument, states acquire their autonomy in the context of geopo- 
litical conflict from the development of their extractive apparatus and the 
consequent taxation and coercion of society (McNeill 1982, Tilly 1975, 
Ardant 1975). In Mann's modification of this argument the state gains 
autonomy in the same way, but this autonomy is wrested from "sociospatial 
networks of power which overlap and intersect" that he considers to be more 
heterogeneous and diffuse than society. It is in the process of asserting control 
that the state restructures, imposes order upon, and sets the boundaries around 
society (Mann 19862). These arguments have also been extended through the 
use of rational choice frameworks to consider states as predatory rational 
actors engaged in the maximizing their power and wealth as well as their 
territorial and administrative well-being (Levi 1988, Mann 1986b, 1988, 
Brewer 1989). 

In addition to arguing that state-society relations are crucial to the develop- 
ment of the state, recent scholarship has asserted that the nature of state- 
society relations will largely determine the types of states that emerge. In 
Mann's conception of medieval Europe states were weak and set within what 
he characterizes as an "acephalous federation of a multiplicity of power 
networks" and a variety of social groupings, which exercised varying degrees 
of local autonomy. When states grew dependent on populations for crucical 
resources, they were forced to develop symbiotic relations with the latter. It is 
from the nature of choices allowed by this symbiotic relation that two 
different types of European states emerge: the French model, which is de- 
pendent on the mobilized alternative where the state develops as a large, 
absolutist, centralized military, manpower administration; and the fiscal 
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alternative of the English state which developed a constitutional state (Mann 
19862). Brewer's analysis of the taxation system and the English state's 
ability to extract quick loans from societal forces provides evidence to cor- 
roborate this point (1989). This research raises further questions about Zol- 
berg's reliance on international conflict as the main explanatory variable in 
the differences between the French and the British state (1986). Moreover, 
both Mann's and Brewer's recent analyses have greatly discredited the all- 
too-frequent characterization of the French state as strong and the English as 
weaker (Anderson 1974, Badie & Birnbaum 1983, Birnbaum 1982). They 
demonstrate convincingly that the English state had a tremendous capacity for 
extracting considerable sums in taxation and loans, a type of capacity which 
Mann attributes to the British state's infrastructural power over society (Mann 
1986a,b, 1988, Brewer 1989). The English state was therefore quite strong, 
while the French state had a more conventional, despotic kind of power based 
on the extraction of resources through coercion. 


Societal Crises and State-Making 


The development of the state according to this argument is also a western 
phenomenon—the state primarily developed to maintain social integration. 
According to Badie & Birnbaum, the state is “a unique social invention 
devised to solve the specific crises of the western European societies at a 
particular point in their development" (1983). The role ascribed to the state 
differs according to its degree of autonomy. One argument emphasizes the 
internal crisis of feudalism as the propeller of state formation. In the Marxist 
tradition, the emergent absolutist state is an arm of the aristocracy and 
therefore has only relative autonomy (Anderson 1974). Another strand sees 
the emergent state as an autonomous structure that came into being during the 
feudal crisis. While it entered into social and political bargaining with the 
dominant class of the era, it tried at the same time to dominate and destroy 
that class (Badie & Birnbaum 1983). Finally, the administrative structure of 
the state developed according to this internal crisis logic; the state developed 
judicial, administrative, and financial structures first, then military warfare 
machines and diplomatic apparatuses (Strayer 1970). 

Most scholars within this perspective have a somewhat developmental view 
of state society relations, where the state first differentiates itself from society 
and then, after consolidating its position, furthers the goals of the system 
(Badie & Birnbaum 1983, Bimbaum 1982). In the intimate relationship 
between state and society, a weak state will invite a civil society that 
coordinates itself, while a strong state will assert its independence from the 
dominant classes. The English and US cases represent weak states, while 
France represents the strong state defined and based on the existence of a 
“self-reproducing, meritocratically recruited bureaucracy” (Badie & Birn- 
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baum 1983, Birnbaum 1982). The French state was not always strong; it 
alternated between no autonomy and internal cohesion and autonomy; but 
once it established its internal coherence through homogeneity, principles of 
meritocracy, and stable elite recruitment it could assert its independence 
vis-à-vis society. It is therefore the development of state capacity and the 
consequent institutionalization of the French state that brought about its 
autonomy (Birnbaum 1982). The historical analysis of American state build- 
ing represents a variant of this approach. Here, rather than society needing the 
state, it is capitalism that needs the state. In the development that follows, the 
state is constrained by preexisting political institutions. A convincing argu- 
ment is made that the state here is weaker since society was organized prior to 
the state; universal suffrage and participatory politics developed before state 
formation and have already structured social institutions (Skowronek 1982). 


The Cultural Dynamics of State-Making 


This approach presents an entirely different view of state-society relations, a 
view that does not invalidate the external geopolitical and internal crises 
arguments but extends them by adding a cultural dimension. State and society 
engaged in a struggle, but this struggle is seen to operate in the cultural arena. 
The state gained power over the population through coercion but was then 
able to gain their consent as well by making them citizens entitled to certain 
rights from the state. The state in this sense is also a cultural creation; in its 
struggle with society it forms institutions that regulate social life and con- 
stantly redefine the boundaries of culture (Nettl 1968, Corrigan & Sayer 
1985). In this relationship the state draws its strength not from its autonomy 
and capacity but rather from the allegiance and identification of the communi- 
ty as a national community which can be as much the result of a state-society 
understanding as of coercion and indoctrination. In the emphasis on the 
cultural practice of the state, state formation gains another dimension in state 
society relations; that of the construction of the cultural context in which 
society operates. Also, as the state is formed, there is a separation between 
religious and secular spheres, which reshapes the cultural context of society 
(Badie & Birnbaum 1983). 

The Negara in nineteenth century Bali was a state that was part of society in 
the sense that it was one institution making political and administrative 
decisions of society. However, the state also constituted the normative order 
within which society is embedded. Although state capacity was maintained 
within a certain structure, the main strength of the state came from its 
symbolized center, which focused on acting out cultural unity. Its domination 
of society then was not expressed in administrative functions but instead in 
exemplifying the notions of kingship to society in the concept of the theater 
state. In Negara ceremony makes the state, differentiates it from society, 
exemplifies rule to society, and thus reproduces itself (Geertz 1980). 
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Research in this area has moved from considering the formation of the state 
as part of a process in which one type of organization asserted itself over 
society by developing its administrative, extractive capacities under external 
or internal predicaments, toward embedding the state into society as one of 
many administrative organizations. Here, the state does not need to assert 
dominance in administrative functions since it constantly reasserts its domina- 
tion over society by cultural or theatrical, symbolic performance. 


STATE FORMATION IN THE NON-WESTERN WORLD 


In 1975 Tilly observed that changes in the world situation have made it "quite 
unlikely that the exact sequences of events we have lovingly reconstructed 
from the European record will ever occur again" (1975:601), and both 
postcolonial history and scholarly research on non-Western states confirm the 
truth of this statement. For one thing, the inernational context within which 
states develop has been dramatically transformed. First, the essence of a 
major aspect of the state-formation hypothesis was the importance of warfare 
in the development of the state's administrative and extractive machinery, but 
the present international norm of nonintervention makes this avenue less 
likely (Migdal 1988). Second, in the competition between states in early 
modern Europe, those state machineries not strong enough to survive col- 
lapsed, whereas the new international system has a tendency to nurture and 
provide legitimacy to weak states (Jackson & Rosberg 1986). Finally, in 
western Europe there was a strong relationship between state makers and 
capitalist development, with capitalism helping to make states and state 
makers helping to make capitalism (Wallerstein 1974, Tilly 1975). But today 
state makers in developing countries are in an unequal and disadvantageous 
relationship with the international capitalist system. 

In addition to differences in the international environment, non-Western 
countries also diverge from early modern Europe in the internal conditions 
affecting the state and state-society relations. While European state makers 
were able to define their boundaries, many Third-World elites found them- 
selves inheriting predetermined state structures and boundaries (Kasfir 1984, 
Mazrui 1986, Kazancigil 1986, Azarya 1988, Callaghy 1988, Migdal 1988). 
European state makers found themselves confronting relatively homogeneous 
societies that were becoming ordered in terms of classes and economic 
interests, whereas postcolonial state elites faced societes that were heteroge- 
neous and divided along tribal, kinship, ethnic, and religious lines (Smith 
1986, Mazrui 1986, Anderson 1986). The emerging class structure of Eu- 
ropean societies facilitated state development and consolidated the position of 
state elites; by contrast, the conditions facing state makers in developing 
countries required them to exert much more effort simply to remain in power 
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(Migdal 1988). To use Mann's categories, state elites in developing countries 
relied primarily on despotic power because their inability to create effective 
state administrative apparatuses hindered the development of infrastructural 
power (Mann 1986b). 

The literature on state formation in early modern Europe acknowledges the 
importance of society in the process, but its primary focus remains the 
expansion of the state. In the non-Western literature the importance of 
state-society relations in state formation is central. Within this literature, 
Anderson's research (1986) comes closest to the European statist approach. 
She pays less attention to society, attributing major social structural and 
political transformations to the historical antecedents of state formation. For 
example, in Tunisia the continuity and stability that characterized state admi- 
nistrations from the Ottoman to the French to the postcolonial eras led to 
state-society relations in which the state restructured and subjugated society 
according to patron-client relations. By contrast, Libya enjoyed neither con- 
tinuity nor stability in state administrations from the Ottoman to the Italian to 
the postcolonial eras, resulting in a Libyan state that was unable to restructure 
society, and in which fragmented patterns of kinship and tribal organization 
provided a continual challenge to state power (Anderson 1986). 

For much of the rest of this literature, society plays a crucial role in state 
formation and actively contests state attempts to reorganize it. In research on 
Zaire, Callaghy characterizes the relationship between state and society as a 
struggle in which a patrimonial-administrative monarchy attempts to increase 
its power internally by trying to dominate society and externally through 
action in the international system (1984a,b, 1988). He argues that the nature 
of state centralization depends on the social structure of society, especially the 
persistence of traditional authority factors (Callaghy 19842). Where society is 
characterized by weak traditional authority bases, the state can penetrate. 
Where society is characterized by "archaic state structures," which are stron- 
ger, the state has difficulty implementing its prefectoral system (Callaghy 
19842). 

Perhaps the finest statement in this literature is offered by Migdal's analysis 
of state-society relations in which the state is weakened by a strong society 
and the capacity of the state translates directly into state strength (1988). The 
strength or weakness of states as they are formed in the postcolonial era is 
based on the particular nature of colonial authority and government. In Sierra 
Leone, where the British pursued a strategy of fragmenting social control at 
the societal level, a weak post-colonial state ensued. In Israel, where the 
British encouraged the consolidation of various societal groups under the 
aegis of a Jewish agency, the end result was a strong state. The international 
context and the global economy are considered to influence this process, but 
they play a decidedly secondary role. 
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Finally, in Azarya's model (1988) of state-society relations, societal re- 
sponses to state actions usually take one of two paths, incorporation or 
disengagement. If incorporation is the primary response, then the state be- 
comes the central organization in which traditional structures and authorities 
are embedded. If society chooses to disengage, then social and economic 
institutions move away from the state and traditional structures retain and 
enhance their salience (Azarya & Chazan 1987). 

Recognizing the historical conditions that placed society in a prominent 
role in the state building process has led to a more balanced view of state 
formation in the developing countries than is found in explanations for early 
modern Europe. A similar attention to state-society relations could provide a 
more nuanced and complete depiction of European state formation. 


ECONOMIC DEVELOPMENT 


Much of the research on the role of the state was motivated by the most 
critical issue facing a majority of non-Western states, that of economic 
development. In these societies, the perception was that the rapid economic 
growth necessary to compete in the world market required planned and 
sustained state intervention (Gerschenkron 1962, Hirschman 1971). The rea- 
son for state intervention varied according to broad levels of industrialization. 
In some countries economic development was relatively advanced but high 
levels of class conflict threatened its progress. The state was then responsible 
for maintaining law and order and regenerating economic growth. In other 
countries class conflict was not the issue because economic development was 
still at an incipient stage. Here the state's primary role was to act às an engine 
for modernization (O'Donnell 1978, Smith 1986). 

Among the most important of the factors contributing to economic growth 
are state autonomy and state capacity, because in these state-planned cases the 
state needs to have control of the development process. In this literature, state 
autonomy and state capacity refer to the internal characteristics of the state as 
well as its relationship to groups in society (Rueschemeyer & Evans 1985, 
Evans 1989). Since states are operating within an international capitalist 
order, the political and market dynamics of this order also affect state 
autonomy and capacity vis-à-vis society (Evans 1981, Evans et al 1985). 

We divide our discussion of the state's role in economic development into 
four categories, using state-society relations as our central criterion. Using 
our categories of state autonomy and capacity we specify the conditions under 
which the state furthered economic development. First we look at states in 
which the state clearly dominated society. These include the Bureaucratic 
Authoritarian states of Latin America and the Middle East as well as of some 
East Asian countries. Second, we examine the literature analyzing cases 
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where a strong state confronts a strong society. These include postcolonial 
states in the Middle East and Asia as well as the newly democratizing 
countries in Latin America. Third, we discuss the literature which focuses on 
weaker states that confront strong societies, primarily those in Africa. Finally, 
we end with states that recently discovered a highly valued resource and 
developed their economies and structured their society around this new 
wealth. 


Strong States, Weak Societies 


The Bureaucratic Authoritarian state represents the ultimate triumph of the 
state over society. It has strength, autonomy, and capacity. What are the 
conditions under which the state acquires autonomy, develops capacity, and 
directs economic growth? In Latin America, the Bureaucratic Authoritarian 
states had initial autonomy because of the class conflict and the economic 
failures of previous regimes (O'Donnell 1978, 1979). By deactivating and 
depoliticizing the popular classes the Bureaucratic Authoritarian state restored 
law and order and thus enhanced its autonomy to implement long-term 
economic and political changes (O'Donnell 1978, 1979, Hirschman 1971). In 
contrast to the Bureaucratic Authoritarian state, which is a reaction to a crisis 
of accumulation, in Taiwan and Korea the state is strong because it was 
organized in the aftermath of war and continues to exist within the context of 
external threat. 

For the Bureaucratic Authoritarian state, initial autonomy results from 
structural class conflict within society that cannot be resolved without outside 
intervention (O'Donnell 1978, Stepan 1978, Rueschemeyer & Evans 1985, 
Waisman 1986, Malloy & Seligson 1987). Initial state autonomy differed 
from country to country, varying according to the preexisting societal con- 
ditions. Where strong divisions existed and class conflict preceded the in- 
stallation of the Bureaucratic Authoritarian state, as was the case in Chile 
prior to 1973, state autonomy was higher. By contrast, where sectoral and 
class conflict had been minimal, as was the case in Peru and Uruguay, the 
state enjoyed little autonomy (Stepan 1978, Collier 1979, Stepan 1985). 
Turkey exemplifies those countries between these poles, with strong sectoral 
conflict providing the Bureaucratic Authoritarian state with the required 
autonomy (Barkey 1990). The East Asian states also possess a high degree of 
autonomy. In Taiwan the state drew its strength from mainland China rather 
than Taiwanese society and therefore had considerable autonomy vis-à-vis the 
latter (Gold 1986, Amsden 1985). 

While autonomy results from state-society interaction, the state's capacity 
to carry out its objectives is a product of its internal institutional structure as 
well as its ability to mobilize external resources such as expertise and capital. 
In these cases where the state is directing economic growth, state autonomy 
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and capacity both must be high for development to succeed. And the record 
suggests that states were indeed able to embark on ambitious plans and carried 
them through at least in part. In Brazil the initial efforts of the state resulted in 
high levels of economic growth. In Chile the state was able to carry out its 
plan of denationalization and the introduction of free market principles and 
practices, even though the strategy ultimately reduced the state's capacity. 
And in Peru the state undertook major economic transformations, including 
land reform (Evans 1979, Stepan 1978, 1985, Rueschemeyer & Evans 1985). 
Like its Latin American counterparts, the Turkish Bureaucratic Authoritarian 
state also assumed power because of an economic crisis, but it is argued that 
this crisis was brought about by the previous state's lack of autonomy to 
implement its economic policies, rather than by the failures of the policies 
themselves (Barkey 1989, 1990). Under these conditions the new state was 
able to acquire and maintain autonomy and capacity and to carry out the same 
economic policies as its predecessor, but to much better effect. 

While there is agreement in the literature that these states dominate their 
societies, the specifics of the state-society compact vary across cases and 
affect the state's ability to carry out its projects. For Chile, for example, 
scholars agree that continued class unrest as well as harsh repression together 
with large infusions of foreign capital were crucial in determining state 
control (Stepan 1985, Garreton 1986). A case where societal groups resisted 
state intervention is presented by Stepan, who argues that Peruvian society 
resisted the intense corporatist restructuring attempted by the Bureaucratic 
Authoritarian state (1978). While the literature on Brazil seems to confirm 
that initially economic growth maintained the state in legitimate control over 
society (Stepan 1985, Cohen 1987, 1989), most scholars add repression and 
corporatism to the list of mechanisms of state control and societal reorganiza- 
tion. Cohen, however, presents evidence that state elites used corporatist 
structures to diffuse state ideology and to carefully manipulate working class 
support, exemplifying a case where the power of state elites was based on 
consent rather than coercion (1989). Finally, the East Asian cases offer 
examples where existing societal norms and structures aided the state. In 
Taiwan, the near-total absence of a rural elite allowed the state to use 
agricultural taxes and surplus production to finance industrialization without 
significant opposition (Amsden 1985, Gold 1986). In Korea, the high status 
and autonomy historically accorded to the bureaucracy allowed the state to 
recruit members of traditional elites to its service and then demand their 
allegiance to state goals rather than those of their own groups (Evans 1989). 

This set of cases demonstrates that despite variations in the levels and types 
of state strength, autonomy, and capacity, the specific characteristics of 
economic development, even when state directed and controlled, are ul- 
timately shaped by the nature of state-society relations. 
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Strong States, Strong Societies 


We now turn to those cases that illustrate the nature of state-society interac- 
tion when the state is strong but societal groups are already strong or are 
gaining strength vis-à-vis the state. In some countries such as Mexico, India, 
and Egypt the state's prominent role in the economy is due to its political 
leadership during revolutions and independence movements; however, its 
autonomy is constrained by preexisting societal formations. In others, such as 
the newly democratizing post-Bureaucratic Authoritarian states, the state 
continues to maintain control over large parts of the economy but it is 
increasingly challenged by a newly reorganizing society asserting its right to 
political participation. These two types of states have strong internal capacity, 
but their autonomy from society is restricted. 

Scholars of India and Mexico are confronted with an apparent contradic- 
tion: all agree that the state plays a prominent role in economic development, 
but it confronts strong and highly organized societal groups with their own 
economic interests. Not surprisingly, therefore, these scholars differ on the 
level of autonomy they accord the state. Kohli (1987) denies the Indian state 
any autonomy, arguing that it is weak and "captured," whereas Bardhan 
(1984) argues that it is able to formulate goals and policy directions in- 
dependently of the interests of propertied classes. Rudolph & Rudolph (1987) 
eschew the category altogether, characterize the Indian state as semi- 
autonomous or "constrained," and treat it as a third actor that influences 
economic development along with labor and capital. All agree, however, that 
while the state is able to formulate and implement economic strategies with 
some success, its ability to carry them out fully is hampered by the societal 
pressures generated by a pluralist open polity. For Mexico, Grindle (1986) 
and Bennett & Sharpe (1980) assert that state planning and implementation of 
economic policy and its reliance on and building of the private sector are the 
main reasons for its continued ability to direct development. Hamilton pre- 
sents a dissenting view that the Mexican state was less able to control and 
build the private sector since foreign capital and business elites often restric- 
ted its autonomous decision-making process (1982). 

Finally, the nature and capacity of the Egyptian state has been the subject of 
a vigorous debate in which authors have concentrated on different aspects of 
state action and their outcomes (Waterbury 1983, Migdal 1988, Binder 1978, 
Ansari 1986). On the one hand, Waterbury declares the Nasserite state 
autonomous from the dominant classes, in the manner of a classic bureaucra- 
tic-authoritarian state. Extraneous factors, the socioeconomic characteristics 
of Egypt (illiteracy, poverty, demography) and its dependence on external 
sources of finance account for the state's inability to implement its policies 
and spur economic growth (Waterbury 1983). On the other hand, some have 
stressed the nonautonomous captured nature of the state, arguing for a domi- 
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nant rural middle class infiltrating the state and therefore controlling the 
crucial stage of implementation (Binder 1978, Ansari 1986). Migdal assumes 
a moderately weak Egyptian state which is able in the initial thrust to 
implement reforms that end up destroying the existing structure, a structure 
that is rebuilt according to dominant local interests (1988). The strong society 
approach is corroborated by Springborg's study of informal regional networks 
that might promote class interests against states (1982). 

The current trend toward redemocratization in countries formerly con- 
trolled by Bureaucratic Authoritarian states reflects the successful movement 
by societal groups to reorganize themselves and to reassert their economic and 
political interests, thus limiting the state's autonomy. The recent literature on 
redemocratization is focused on two issues: first, the conditions that lead to 
the demise of the Bureaucratic Authoritarian state and the beginning of 
redemocratization (O'Donnell et al 1986, Malloy & Seligson 1987, Fishlow 
1989); and second, the effects of these developments on societal reorganiza- 
tion and the state-society relations that result (Stepan 1989). Scholars agree 
that redemocratization results from the reorganization of civil society. 
Churches, grass-roots organizations, and labor unions all play major roles in 
pressuring the state to liberalize and make concessions to society (Garreton 
1986, Mainwaring 1989, Keck 1989). While the state continues to be a major 
force for political and economic action, it is obliged to respond to the 
demands of reemergent classes, and a result is the decrease of state autonomy 
as well as state capacity. However, all classes do not appear to have equal 
strength or access to the state. In particular, the bourgeoisie, whose accep- 
tance of state autonomy was crucial in the establishment of the Bureaucratic 
Authoritarian state, continues to figure largely in this period (Stepan 1989). 
As Bermeo has noted in a recent review essay (1990), all bureaucratic 
authoritarian states were followed by center-right coalitions which have 
avoided radical changes. 

These cases demonstrate that even strong states with a degree of autonomy 
and the capacity to formulate and carry out economic strategies are con- 
strained if society is also strong. Thus, the economic development programs 
that are adopted cannot be determined solely by the state but instead are the 
outcome of negotiated state-society relations. 


Weak States, Strong Societies 


In this section we turn to states characterized in the literature as weak, but 
which confront societies at least as strong as the state or stronger. These states 
did not come into being as a response to economic crises, but were born out of 
revolutions, independence movements, or abrupt transfers of power (Callaghy 
1988). The societies that the new states led had been restructured by colonial- 
ism, but they had not usually engaged in direct state-society interaction 
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(Migdal 1988). These societies entered the postcolonial era as highly differ- 
entiated entities where traditional categories such as ethnicity, tribe, and 
various cross-cutting clientelistic networks predominated. Incipient classes 
based on shared economic interests had emerged during colonial rule, but 
the older categories continued to have mobilization potential (Kasfir 1984, 
Migdal 1988). Thus, the class-based alliances that had facilitated economic 
development in the west were weak, while traditional social cleavages 
not usually associated with state-led, national economic growth strategies 
were strong. : 

Scholars of these postcolonial states of Africa and Asia agree that they are 
weak but still retain some limited autonomy. How can we account for this 
autonomy? The literature suggests two sets of answers, one focusing on 
state-society relations and the other on the international context. The frequent- 
ly observed sources of autonomy are the state's monopoly of force and 
internal societal divisions. Scholars point to Uganda and Chad as states that 
retain power through sheer coercion, using what Mann terms despotic power 
(Mann 1986b, Jackson & Rosberg 1986). In states whose monopoly of force 
is not absolute, however, such as Ghana and Cameroon (Forrest 1988, Azarya 
& Chazan 1987, Bayart 1979) internal societal conflict often provides the 
opportunity for an autonomous state that can mediate between warring fac- 
tions (Kasfir 1984, Forrest 1988, Callaghy 1984b, Azarya & Chazan 1987, 
Young & Turner 1985, Bayart 1984.) 

On the other hand, scholars argue that the state draws autonomy from its 
international context. First and most importantly, continuing ties to the former 
colonial powers and their relations with international agencies and multi- 
national corporations have provided these states with financial and political 
resources to enhance their autonomy (Kasfir 1984, Fatton 1988). Most Afri- 
can states have not been able to maintain autonomy from the international 
system; they have nonetheless preferred foreign to domestic dependence 
(Cooper 1981, Callaghy 1984a,b, Fatton 1988). Second, the international 
system's need for states as juridical categories also provides legitimacy to 
particular weak African states that might otherwise disintegrate (Jackson & 
Rosberg 1986). Finally, unexpected discoveries of abundant resources or the 
sudden market demands for particular commodities can boost the state's 
autonomy so long as the state has mechanisms to appropriate the wealth 
(Kasfir 1984, Parson 1984, Azarya & Chazan 1987). 

While agreeing that these weak states possess a degree of autonomy, 
scholars point to the relative lack of state capacity. The bureaucratic 
machineries of these states are often based on patronage and tend to be 
overdeveloped in size, inadequate in expert knowledge, and lacking in finan- 
cial resources. Moreover, the military as an organization reflects traditional 
kin and clientelist structures, and these prior loyalties are not transformed by 
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professional training that emphasizes allegiance to the state (Chazan et al 
1988). As a result, the bureaucratic and coercive apparatuses of the state are 
unable to increase its capacity. 

The literature on economic development illustrates that the low level of 
state autonomy and the lack of capacity make it difficult for the state to direct 
economic growth. Some African states do stand out as exceptions to this 
pattern, however. The Ivory Coast state, lacking abundant natural resources 
or plentiful foreign capital, was forced to implement development strategies 
that relied on its rural sector, and so it turned to state-directed agricultural 
capitalism to achieve impressive economic growth (Berg-Schlosser 1984, 
Callaghy 1988). Kenya reached similar levels of development in the agri- 
cultural sector by drawing externally on large amounts of foreign capital and 
internally by encouraging the development of a national bourgeoisie (Young 
1982, Callaghy 1988). Nigeria, on the other hand, was blessed with a natural 
resource, oil, that could be exploited to great effect, although the ease with 
which growth could be achieved led the Nigerian state to neglect all other 
economic sectors (Young 1982, Callaghy 1988). 

More frequently, however, states with little autonomy and capacity vis-à- 
vis their societies have been concerned primarily with ensuring that the ruling 
elite remains in power (Callaghy 1984b, Schatzberg 1980 ). These states' 
economic policies have gone beyond neglect to the active discouragement of 
economic development in particular sectors, as in the Ghanaian state's 
ambivalent attitude toward cocoa despite that product's contributions to state 
revenue (Lonsdale 1981). When the state does attempt to intervene in the 
economy, its efforts are often designed more to secure resources for itself than 
to implement coherent economic strategies. Interventions in the agricultural 
sector in tropical African countries in particular have resulted in food crises 
for the peasantry (Bates 1981). Under these conditions, it is not surprising that 
society frequently resists state policies. In Uganda, the state was unable to 
capitalize on increased production of cash crops because coffee farmers used 
their bargaining position to develop autonomy vis-à-vis the state (Bunker 
1987). 

The literature on these states illustrates how difficult economic develop- 
ment is in countries whose states lack capacity, even if they have a degree of 
autonomy. The problem is compounded by state-society relations, since 
society’s prior social organization provides the bases of resistance to state 
intervention. 


Rentier States 


Yet another category of states is distinguished from the others by their 
resource structure. As mentioned in the case of the rich state of Nigeria, these 
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states develop, acquire their autonomy, and in turn make economic policy as a 
result of the discovery and exploitation of a major resource. First and fore- 
most, they are states whose allocative task makes them viable entities. For the 
richest ones the extractive apparatus of the state does not exist. Two im- 
mediate conclusions can be drawn from this; the first, that unlike states which 
developed their extractive capacities first, these states do not have well- 
developed administrative machineries (Beblawi & Luciani 1987, Chaudhry 
1989). An important part of their functional arm is missing, therefore; if their 
resource base dries up, their viability as states is severely threatened. Second, 
state-class relations can take entirely different formats in these societies. The 
argument has been made that in the absence of extractive tasks, there are no 
class relations between the state and the population (Bukharin 1965, De- 
lacroix 1980). These societies then cannot be organized and cannot challenge 
the state along class lines. If the country is small, and the rest of the economy 
is not well-developed (Mahdavy 1970), the likelihood of organization along 
class lines is low. Hence, in the absence of class, other bases of association 
will be salient, and the more recent the incorporation into the world system, 
the more likely that the redistributive state will encounter "tribal, ethnic and 
religious challenges" (Delacroix 1980). 

Distributive states are states with high degrees of autonomy acquired 
largely from their resource base and the resulting economic development. 
States that do not need to extract are not dependent on their society (Mahdavy 
1970, Anderson 1987, Chaudhry 1989) and therefore have much more leeway 
in implementing economic and social policies of their choice. The state has 
been involved in the economy at different levels ranging from acting as a 
simple allocative agent to a builder of infrastructure or an active competitor in 
the market (Chatelus 1987). The tendency also exists for the government 
bureaucracy to turn into a rentier class geared toward the collection of 
revenues, establishing an independent source of income with relatively little 
concern for economic development. Most of the arguments about prerevolu- 
tionary Iran center around this theme of a large and populous country with 
limited development of certain sectors of the economy coupled with the 
enrichment of a rentier class (Mahdavy 1970, Halliday 1979, Skocpol 1982). 
Two other cases are considered to be classic in their combination of financial 
autonomy with some redistribution and some coercion; small as well as very 
rich, Qatar and Kuwait have states that enjoy oil wealth and let a rather wide 
segment of the population enjoy it as well. Kuwait, for example, has a highly 
developed oil industry, the revenues of which are redistributed by the state to 
the Kuwaiti population as well as to the "exploited" class, which consists of 
well-paid guest workers. But it is also a despotic state which relies on some 
level of coercion to ensure its survival (Khouja & Sadler 1979, Delacroix 
1980). Recently most wealthy Arab states, especially Qatar, Kuwait, the 
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United Arab Emirates, and Saudi Arabia, have undertaken policies of "sub- 
sidized privatization" and have therefore encouraged the development of an 
entrepreneurial class (Chatelus 1987). 

Economic prosperity has helped these states dominate their societies. Yet, 
periods of economic bust demonstrate their fragile hold over societal groups. 
Financial autonomy is rewarding during periods of economic boom because 
the states that develop under circumstances of high external capital inflows 
acquire mechanisms to handle periods of growth and accumulation stimulated 
from the outside, and they have the ability to transfer their gains to the 
society. When economic crises hit, states that have only financial autonomy 
are confronted by their inability to extract, as well as by societal forces that 
resist extraction. Recently an argument has been made that autonomy depends 
not on resources generally, but rather on the type of resources and the 
structure of the resource acquisition and reallocation (Chaudhry 1989). The 
comparison of Yemen to Saudi Arabia, two states equally affected by the 
economic crises of 1983-1987; demonstrates that the nature and degree of 
state-society relations as structured by the type and allocation of resources had 
a strong impact on the state's ability to implement reforms (Chaudhry 1989). 
Thus, a bonanza such as oil does not necessarily provide a state with direct 
autonomy over society; autonomy has to be acquired through the structuring 
of state-society relations. 


SOCIAL MOVEMENTS, ETHNIC GROUPS, 
AND THE STATE 


While the role of the state has figured prominently in scholarly research on 
economic and political development for some time, only recently has it 
become a central factor in the study of social movements and ethnicity. 
Initially, scholars treated both of these phenomena as internally determined. 
Psychological explanations of social movements and revolutions emphasized 
that they originated in the frustration of individuals against social changes 
such as industrialization or urbanization and represented a nonrational, dis- 
organized reaction to grievances (Smelser 1962, Gurr 1970). Analogously, 
ethnicity and ethnic mobilization were viewed as primordial sentiments that 
bound people together in premodern communities; ethnicity forged ties of 
emotion rather than of interest and would be replaced during modernization 
by loyalties to class and to nation (Geertz 1963, Lipset & Rokkan 1967). 
These theories were eventually supplanted by theories that continued 
to focus on society but portrayed social action differently. Scholars now 
argued that social movements and ethnic identity were not irrational or emo- 
tional but rather were rational means of organizing to achieve collective 
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benefits. Resource mobilization theory posits that the likelihood of social 
movements rests not on the grievances of a group but on the group's improved 
access to external resources such as income and information (McCarthy & 
Zald 1977, Jenkins 1983). Political opportunity theorists tend to view external 
resources as problematic and instead attribute success in organizing for and 
achieving benefits to internal resources (Tilly 1978, McAdam 1982). For 
example, the civil rights movement of the 1950s and 1960s was attributed to 
increased internal resources such as the development of networks among 
university students and the power and legitimacy of the black church (McAd- 
am 1982, Morris 1984). Ethnic identity formation and ethnic mobilization 
were also perceived to be strategic and “situationally activated" (Nagel 1986), 
so that changes in economic or political conditions led to the selection of new 
or resurgent ethnic categories (Young 1976). Thus, the division of Nigeria 
into three large regions created new regional-level identities where smaller 
community and kin-level ties had predominated (Nagel 1986). This more 
instrumental view of ethnic identity acknowledged the effect of external 
factors, such as political and economic policies, but concentrated the an- 
alytical attention on the internal dynamics of the group. In social movements 
this focus meant how the group developed resources (McAdam 1982, Morris 
1984), and in ethnicity it meant how ethnic boundaries arose or adapted to 
changing external political and economic factors (Horowitz 1985, Laitin 
1986). 


The Role of the State 


Application and reformulation of these theories made it clear that internal 
ethnic and social dynamics could not be fully understood without analyzing 
external factors on their own terms, especially the role of the state. It had 
already become evident that state policies constituted one of the major 
determinants of mobilization and shifting identity patterns (Hechter 1975, 
Tilly 1978, Olzak 1983, Nagel 1986, McAdam et al 1988, Cornell 1988). A 
series of questions then followed: What were the conditions under which the 
state influenced social movements and ethnic identity formation and mobiliza- 
tion? What were the intended and unintended consequences of state action? 
How did the state as an institution affect social processes? Was the state, as 
some scholars argued, primarily an arena in which mobilization occurred, a 
force that unintentionally restructured social relations, or was it an autono- 
mous actor with interests of its own that intentionally encouraged some social 
and ethnic movements over others in order to achieve its own goals? And just 
as important, were there patterns to state action that scholars could identify 
across cases to develop a more general theory of state influence on social 
movements and ethnicity? 
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THE STATE'S INFLUENCE ON SOCIAL MOVEMENTS 


In much of the research, the state's role is often characterized in one of the 
following three ways. First, the state is viewed as possessing resources for 
which social movement organizations and ethnic groups compete; it therefore 
has the power to structure relations between groups and itself as well as to 
transform the character of the groups through its actions (Tilly 1978, McAd- 
am 1982, Bright & Harding 1984). For example, if resources such as affirma- 
tive action positions are distributed along certain ethnic boundaries and not 
others, such as “Hispanic” rather than “Mexican” or “Puerto Rican,” it will 
encourage groups to use the new identifiers rather than previously salient 
ones; this could eventually transform the nature of the group (Nagel 1986). 
Second, states may implement policies that favor one group over another; the 
US Internal Revenue Service’s category of tax-exempt status favors organiza- 
tions that can be classified as not-for-profit and puts those whose activities 
cannot be so categorized at a relative disadvantage (McAdam et al 1988). 
Finally, the state may blatantly affect social movement and ethnic mobiliza- 
tion by repressing certain groups while tolerating or encouraging others 
(Enloe 1981, McAdam et al 1988). Indira Gandhi’s encouragement of 
Sikh extremists in an attempt to undercut the strength of the moderates and 
increase her own power resulted in the rise of radical, secessionist Sikh views 
and the decreased likelihood of a compromise solution to the dispute (Brass 
1990). 

However, while scholars positing these characterizations of the role of the 
state make clear its importance in influencing the shape and actions of social 
movements and ethnic groups, their primary interest remains the ethnic group 
or social movements, not the state. As a result, many aspects of state action 
are left unspecified. In most of these cases, the state could either be acting in 
its own interest or in the interests of a particular class; it could be responding 
to situations or creating them. With these approaches it is usually difficult to 
tell because the state itself is not the object of investigation. Frequently, the 
effect of the state’s actions on ethnic groups or social movements is character- 
ized as “unintended,” which leaves the impression that the state has little 
interest or ability to devise and carry out policies aimed specifically at ethnic 
groups or social movements (Nagel & Olzak 1982, Olzak & Nagel 1986). An 
important exception to this tendency is Birnbaum (1988), who examines how 
states shape collective action, both intentionally and unintentionally. While it 
is possible to draw general inferences about the state’s influence from these 
studies, they tend to remain at the level of the political opportunity literature 
(Tarrow 1988), in which the state is important primarily as an objective 
influence on the movement but is not investigated on its own, and state 
autonomy and capacity are not specifically examined. 
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STATE INTERVENTION IN ETHNIC 
AND COLLECTIVE ACTION 


The predominance of society-centered explanations has been supplemented in 
recent years by a focus on the state that characterizes it as autonomous, that is, 
as possessing separate interests, as well as having the capacity to achieve 
these interests. In this research the state is usually strong and capable, and it is 
seen as having an explicit interest in or conception of ethnicity or collective 
action that shapes its behavior. Not surprisingly, among the earliest states 
recognized as possessing interests separate from those of the society they 
ruled were colonial states, since their autonomy and legitimacy were not 
contingent on their relations with the colonized society. The nature of colonial 
rule led to one of two strategies: indirect rule as pursued by the British, which 
relied on indigenous power structures, or a more centralized style of rule, as 
preferred by the French. 

Ironically, the desire to rely on indigenous political organization could have 
a more profoundly transforming effect on ethnic groups than did the more 
intrusive style of centralized government. In Nigeria, the British desire to rule 
indirectly through existing hierarchical power structures and their conviction 
that Nigerian society was corporately organized into tribes led them to create 
new leaders and power relations if they did not exist, a policy that eventually 
helped transform the basis of Nigerian ethnic identity (Laitin 1985, 1986). 
Similarly, in India the favoritism shown to Muslims by the British Raj and the 
Raj's insistence on seeing religious groups as irrevocably divided led Hindu 
elites to mobilize their own population, even though strong religious cleav- 
ages had not historically divided the two groups; by the time of independence 
the British view of the two groups was widely accepted and Muslims and 
Hindus were so far apart that Partition appeared as the only solution (Robin- 
son 1977, 1979; Brass 1977, 1979). 

While colonial empires provide a ready example of state manipulation of 
ethnic and other social identities, states today also use such strategies to 
control populations within their own borders. For example, the Israeli state 
has helped to recreate and sustain traditional Arab elites, which both wins 
their allegiance and undercuts other emergent elites, especially more radical 
ones (Laitin 1985, Lustick 1980). 

A second area of research that examines how the state consciously in- 
fluences social movements and mobilization potential stresses the cultural and 
ideological assumptions inherent in state actions and how these assumptions 
can reshape ethnic identities and social movements. In research on the 
emergence of a pan-Alaska identity and mobilization effort by Alaska's 
indigenous groups, Darnell (1990) has found that the state's own beliefs about 
how indigenous groups should be incorporated into American society contrib- 
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uted to unifying previously disparate communities and eventually Jed them to 
call themselves Alaskan Natives first and Inuits, Aleuts, or Athabascans 
second. He emphasizes that the state's ideological preconceptions can lead it 
to pursue policies without considering how such ideologically based policies 
will affect its material interests. In an analysis of the Nazi German state, 
Geyer similarly finds that ideology plays a major role in shaping the state's 
actions toward ethnic groups. The Third Reich's obsession with racial purity 
permeated policy decisions and was instrumental in its attitude toward Ger- 
man Jews and other non-German people. In this case ideology was not merely 
a way to rationalize state policies based on other criteria; rather, it was a 
fundamental force directing state treatment of Jews and ultimately had horrific 
consequences for that group (Geyer 1984). In these cases the state possesses 
attributes similar to the colonial state in that it is able to exert dominance over 
the group it targets. 

Finally, recent work on social movements draws on the new institutionalist 
literature to emphasize the role of multiple state actors in different parts of the 
state. In his work on the environmental movement Gale illustrates how 
bureaucrats and agencies diverged in their positions; indeed, some parts of the 
state were promoting the environmentalists while other parts were promoting 
their opposition (1986). In research on the rise of the Townsend old-age 
pension movement in the 1930s, Amenta & Zylan found that the movement 
flourished in states with agencies that were in charge of labor legislation 
(1991). In both of these approaches, the state is not considered as a unitary 
actor, but instead is seen as consisting of an overlapping set of institutions and 
organizations, all of which may not have the same interests (DiMaggio & 
Powell 1983, March & Olsen 1984, 1989, McAdam et al 1988). Equally 
important, these states are seen as exerting influences even if they are not as 
strongly autonomous as the states discussed above. 

The literature on states, social movements, and ethnicity has come far in 
taking state structures and actions into account, but it still has far to go. By 
and large, scholars continue either to treat political conditions as objective and 
unexamined when discussing their influence, or they recognize the state when 
it is strong and autonomous from the group. In either case, the group remains 
the primary object of inquiry. The field still needs research on state-society 
interaction, how social and ethnic movements shape the state, and how less 
strong, less autonomous states affect social movements and ethnicity. 


CONCLUSION 


Even in "state-centered" analysis, the role of society and the nature of the 
state-society relationship has always received attention, and the trend in this 
direction is increasing. This is not surprising, since variables such as 
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autonomy, capacity, and state strength are usually defined in relation to 
society. Therefore, the term "state-centered" is of little value in specifying the 
perspective being used. Statist analyses take account of society, and society- 
oriented analyses recognize the importance of the state. A related issue that 
needs to be addressed is the way in which the state is being operationalized. 
Simply put, is the state an independent or a dependent variable in the 
research? The same studies often operationalize the state in both ways without 
separating and explaining the differences between the two, and thus the 
conceptual contributions are beclouded. 

An encouraging aspect of recent developments in this literature has been 
the breaking down of established dichotomies. Marxist categories are being 
used within a Weberian framework of state action (Tilly 1975, Skocpol 1979, 
1980, Mann 1986a,b). Comparisons are being undertaken that transcend the 
traditional schism between Western European and developing countries (Cal- 
laghy 1984a, Migdal 1988, Barkey 1988, Parikh 1990). Promising theoretical 
developments are arising from empirically grounded, historically specific 
comparative and case studies, which further negate the distinction between 
ahistorical theory building and atheoretical empiricism (Skocpol & Amenta 
1986, Ragin 1987). 
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